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Abstract 

Open, type-level functions are a recent innovation in Haskell that 
move Haskell towards the expressiveness of dependent types, while 
retaining the look and feel of a practical programming language. 
This paper shows how to increase expressiveness still further, by 
adding closed type functions whose equations may overlap, and 
may have non-linear patterns over an open type universe. Although 
practically useful and simple to implement, these features go be¬ 
yond conventional dependent type theory in some respects, and 
have a subtle metatheory. 

Categories and Subject Descriptors F.3.3 [Logics and Mean¬ 
ings of Programs]: Studies of Program Constructs—type structure; 
D.3.3 [Programming Languages]: Language Constructs and Fea¬ 
tures; F.4.2 [Mathematical Logic and Formal Languages]: Gram¬ 
mars and Other Rewriting Systems—parallel rewriting systems 

General Terms Design, Languages, Theory 

Keywords Type families; Type-level computation; Haskell; Sys¬ 
tem FC 

1. Introduction 

Type families are a relatively recent extension to Haskell that allows 
the programmer to express type-level computation (Chakravarty 
et al. 2005). For example, one can say 
type family Elt (a :: *) :: * 
type instance Elt ByteString = Word8 
type instance Elt [b] = b 

The first line declares the type family Elt and gives its kind; the 
second and third are two independent declarations that give two 
equations for Elt. Now the types ( Elt ByteString) and Word8 are 
considered equivalent by the type inference engine, and likewise 
(Elt [ Int]) and Int. Type families have proved to be a popular 
feature in Haskell, dovetailing particularly nicely with Haskell’s 
type classes. Type families are naturally partial and open. For 
example, there is no equation for Elt Char above, so Elt Char 
will never be equal to any other type. On the other hand, the author 
of a new library is free to add a new instance, such as this one: 
type instance Elt (Set b) = b 

However, not all type-level functions can be defined by open type 
families. An important example is the equality function, which 
determines whether two types can be shown equal at compile- 


1 Here we use datatype promotion , allowing data types like Bool , and lists, 
to be used as kinds (Yorgey et al. 2012). 


type family Equal a b :: Bool 

type instance Equal a a = True — Instance (A) 

type instance Equal a b = False — Instance (B) 

The programmer intends these equations to be read top-to-bottom, 
like a term-level function definition in Haskell. However, because 
GHC’s current type families are open, they must be defined by inde¬ 
pendent, un-ordered type instance equations. The two equations 
overlap, so they are rightly rejected lest they be used to deduce 
unsound type equalities. For example, we could reduce the type 
Equal Int Int to both True and False, since both patterns match. 

Yet equality is a well-defined function, and a useful one too, as 
we discuss in Section 2. To fix this omission we introduce closed 
type families with ordered equations, thus: 
type family Equal a b :: Bool where 
Equal a a = True 
Equal a b = False 

Now all the equations for the type family are given together, and 
can be read top-to-bottom. However, behind this simple idea lie 
a number of complexities. In this paper we describe these pitfalls 
and their sometimes non-obvious solutions. We make the following 
contributions: 

• We introduce closed type families with overlapping equations, 
and show how they can readily express programs that were 
previously inexpressible or required indirect encodings (Sec¬ 
tion 2). 

• Our system supports non-linear left-hand sides, such as that for 
Equal above, where the variable a is repeated in the first equa¬ 
tion. It also supports coincident overlap, which allows some 
lightweight theorem-proving capability to be incorporated in 
the definitional equality of types (Section 3.4). 

• We give the subtle rules that govern type family simplifica¬ 
tion, including those that determine when a pattern cannot be 
matched by a type (Section 3). 

• We describe a typed core language that includes both open and 
closed type families (Section 4), and prove that it is type-safe, 
assuming that type families terminate (Section 5). We do that 
by establishing a consistency property of the type equations 
induced by type families. 

• We identify the complications for consistency that arise from 
non-terminating type families and we expose a subtle oversight 
in GHC’s current rules for open type families in Section 6. 

• We have implemented closed type families in GHC as well as a 
number of case studies, such as the units package, an extensible 
framework for dimensional analysis, presented in Appendix A. 
Closed type families are available now in GHC 7.8. 
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In short, the programmer sees a simple, intuitive language fea¬ 
ture, but the design space (and its metatheory) is subtle. Although 
type families resemble the type-level computation and “large elim¬ 
inations” found in full-spectrum dependently-typed languages like 
Coq and Agda, there are important semantic and practical differ¬ 
ences. We discuss these in Section 8. 

2. Closed type families 

Haskell (in its implementation in GHC) has supported type families 
for several years. They were introduced to support associated types, 
a feature that Garcia et al.’s (2003) comparison between C++, 
Haskell, and ML, noted as a C++’s main superiority for generic 
programming. 

Type families were designed to dovetail smoothly with type 
classes. For example, the type function 2 Elt above could be used 
to specify the element type in a container class: 

class Container c where 
empty :: c 

member :: Elt c -4 c —t Bool 
instance Container [a] where ... 
instance Container ByteString where ... 

New instances for Container can be defined as new types are 
introduced, often in different modules, and correspondingly new 
equations for Elt must be added too. Hence Elt must be open 
(that is, can be extended in modules that import it), and distributed 
(can be scattered over many different modules). This contrasts with 
term-level functions where we are required to define the function 
all in one place. 

The open, distributed nature of type families, typically associ¬ 
ated with classes, requires strong restrictions on overlap to maintain 
soundness. Consider 

type family Fab::* 

type instance F Int a = Bool 

type instance F a Bool = Char 

Now consider the type (F Int Bool). Using the first equation, this 
type is equal to Bool, but using the second it is equal to Char. So 
if we are not careful, we could pass a Bool to a function expecting 
a Char, which would be embarrassing. 

GHC therefore brutally insists that the left-hand sides of two 
type instance equations must not overlap (unify). (At least, unless 
the right-hand sides would then coincide; see Section 3.4.) 

2.1 Closed families: the basic idea 

As we saw in the Introduction, disallowing overlap means that use¬ 
ful, well-defined type-level functions, such as type level equality, 
cannot be expressed. Since openness is the root of the overlap prob¬ 
lem, it can be solved by defining the equations for the type family 
all in one place. We call this a closed type family and define it using 
a where clause on the function’s original declaration. The equa¬ 
tions may overlap, and are matched top-to-bottom. For example: 

type family And (a :: Bool) (b :: Bool) :: Bool where 
And True True = True 
And a b = False 

Since the domain of And is closed and finite, it is natural to write 
all its equations in one place. Doing so directly expresses the fact 
that no further equations are expected. 

Although we have used overlap in this example, one can always 
write functions over finite domains without overlap: 


2 We use “type family” and “type function” interchangeably. 


type family And' (a :: Bool) (b :: Bool) :: Bool where 
And' True True = True 
And’ False True = False 
And' True False = False 
And' False False = False 

Nevertheless, overlap is convenient for the programmer, mirrors 
what happens at the term level, avoids a polynomial blowup in 
program size, and is more efficient (for the type checker) to execute. 
Furthermore, when defined over an open kind, such as *, closed 
type families allow a programmer to express relationships (such 
as inequality of types—see Section 2.4) that are otherwise out of 

2.2 Non-linear patterns 

Let us return to our equality function, which can now be defined 
thus: 

type family Equal (a :: *) ( b :: *) :: Bool where 
Equal a a = True 
Equal a b = False 

This declaration introduces the type function Equal, gives its kind 
and, in the where clause, specifies all its equations. The first equa¬ 
tion has a non-linear pattern, in which a is repeated, and it overlaps 
with the second equation. If the domain were finite we could avoid 
both features by writing out all the equations exhaustively, but new 
types can be introduced at any time, so we cannot do that here. 
The issue becomes even clearer when we use kind polymorphism 
(Yorgey et al. 2012), thus: 

type family Equal (a :: k) ( b :: n) :: Bool where 
Equal a a = True 
Equal a b = False 

For example, ( Equal Maybe List) should evaluate to False. It may 
seem unusual to define a function to compute equality even over 
types of function kind (*—►*). After all, there is no construct that 
can compare functions at the term level. 

At the type level, however, the type checker decides equality 
at function kinds all the time! In the world of Haskell types there 
exist no anonymous type-level functions, nor can type families 
appear partially applied, so this equality test—which checks for 
definitional equality, in type theory jargon—is straightforward. All 
Equal does is reify the (non-extensional) equality test of the type 
checker. 

In fact, Haskell programmers are used to this kind of equality 
matching on types; for example, even in Haskell 98 one can write 

instance Num a => Num (T a a) where ... 

Because the type inference engine already supports decidable 
equality, it is very straightforward to implement non-linear pat¬ 
terns for type functions as well as type classes. Non-linear patterns 
are convenient for the programmer, expected by Haskell users, and 
add useful expressiveness. They do make the metatheory much 
harder, as we shall see, but that is a problem that has to be solved 
only once. 

2.3 Type structure matching 

In our experience, most cases where closed type families with over¬ 
lapping equations are useful involve a variation on type equality. 
However, sometimes we would like to determine whether a type 
matches a specific top-level structure. 

For example, we might want to look at a function type of the 
form Int —» ( Bool —» Char) —¥ Int —» Bool and determine that 
this is a function of three arguments. 
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data Nat = Zero \ Succ Nat 
type family CountArgs (f :: *) :: Nat where 
CountArgs (a —> b) = Succ (CountArgs b) 

CountArgs result = Zero 

Because the equations are tried in order, any function type will 
trigger the first equation and any ground non-function type (that 
is, a type that is not a type variable or an arrow type) will trigger 
the second. Thus, the type family effectively counts the number of 
parameters a function requires. 

When might this be useful? We have used this type family to 
write a variable-arity zipWith function that infers the correct ar- 
ity, assuming that the result type is not a function type. Other 
approaches that we are aware of (Fridlender and Indrika 2000; 
McBride 2002; Weirich and Casinghino 2010) require some encod¬ 
ing of the desired arity to be passed explicitly. A full presentation of 
the variable-arity zipWith is presented in Appendix B. To achieve 
the same functionality in a typical dependently typed language like 
Agda or Coq, we must pattern-match over some inductive universe 
of codes that can be interpreted into types. 


2.4 Observing inequality 

Type families such as Equal allow programmers to observe when 
types do not match. In other words. Equal Int Bool automatically 
reduces to False , via the second equation. With open type families, 
we could only add a finite number of reductions of un-equal types 
to False. 

However, the ability to observe inequality is extremely use¬ 
ful for expressing failure in compile-time search algorithms. This 
search could be a simple linear search, such as finding an element 
in a list. Such search underlies the HList library and its encod¬ 
ing of heterogeneous lists and extensible records (Kiselyov et al. 
2004). It also supports Swierstra’s solution to the expression prob¬ 
lem via extensible datatypes (Swierstra 2008). Both of these pro¬ 
posals use the extension -XOverlappinglnstances to implement 
a compile-time equality function. 3 

Type families can directly encode more sophisticated search al¬ 
gorithms than linear list searching, including those requiring back¬ 
tracking, simply by writing a functional program. For example, the 
following closed type family determines whether a given element 
is present in a tree. 

data Tree a = Leaf a \ Branch (Tree a) (Tree a) 

type family TMember (e :: k ) ( set:: Tree k ) :: Bool where 
TMember e (Leaf x) = Equal e x 

TMember e (Branch left right) = 

Or (TMember e left) (TMember e right) 

Implementing this search using overlapping type classes, which 
do not support backtracking, requires an intricate encoding with 
explicit stack manipulation. 


2.5 Summary 

Type-level computation is a powerful idea: it allows a programmer 
to express application-specific compile-time reasoning in the type 
system. Closed type families fill in a missing piece in the design 
space, making type families more expressive, convenient, and more 
uniform with term-level functional programming. 


p Type patterns (no type families) 

F Type families 

Li Substitutions from type variables to types 

Figure 1. Grammar of Haskell metavariables 


3. Simplifying closed family applications 

We have shown in the previous sections how type family reduc¬ 
tion can be used to equate types. For example, a function requir¬ 
ing an argument of type T True can take an argument of type 
T (And True True), because the latter reduces to the former. 

Because the definition of type equality is determined by type 
family reduction, the static semantics must precisely define what 
reductions are allowed to occur. That definition turns out to be 
quite subtle, so this section develops an increasingly refined notion 
of type family reduction, motivated by a series of examples. The 
presentation gives a number of definitions, using the vocabulary 
of Figure 1, but we eschew full formality until Section 4. We use 
the term “target” to designate the type-function application that 
we are trying to simplify. We say that a type n “simplifies” or 
“reduces” to another type T2 if we can rewrite the n to ti using a 
(potentially empty) sequence of left-to-right applications of type 
family equations. We also use the notation n T2 to denote 
exactly one application of a type family equation and n T2 to 
denote an arbitrary number of reductions. Type equality is defined 
to be roughly the reflexive, symmetric, transitive, congruent closure 
of type reduction; details are in Section 4.3. 

We frequently refer to the example in the introduction, repeated 
below, with the variables renamed to aid in understanding: 

type family Equal (a :: n) ( b :: k) :: Bool where 
Equal a a = True — Eqn (A) 

Equal b c = False — Eqn (B) 

3.1 No functions on the LHS 

If we wish to simplify Equal Int Int, equation (A) of the definition 
matches, so we can safely “fire” the equation (A) to simplify the 
application to True. 

Even here we must take a little care. What happens if try this? 

type family F {aw Bool) where 
F False = False 
F True = True 

F (Equal x ypM}True 

Then F ( Equal Int Bool) superficially appears to match only the 
third equation. But of course, if we simplify the argument of F 
in the target, it would become F False, which matches the first 
equation. 

The solution here is quite standard; in type family definitions 
(both open and closed) we do not allow functions in the argument 
types on the LHS. In terms of Figure 1, the LHS of a function 
axiom must be a pattern p. This is directly analogous to allowing 
only constructor patterns in term-level function definitions, and is 
already required for Haskell’s existing open type families. 

We then propose the following first attempt at a reduction strat¬ 
egy: 


3 This extension allows class instances, but not type family instances, to 
overlap. If the type inference engine chooses the wrong class instance, a 
program may have incoherent behavior, but it is believed that type safety is 
not compromised. See Morris and Jones (2010) for relevant discussion. 


2013/11/15 



Candidate Rule 1 (Closed type family simplification). An equa¬ 
tion for a closed type family F can be used to simplify a target 
(F t) if (a) the target matches the LHS of the equation, and (b) no 
LHS of an earlier equation for F matches the target. 

The formal definition of matching follows: 

Definition 1 (Matching). A pattern p matches a type t, written 
match (p, t), when there is a well-kinded substitution El such that 
El(p) = r. The domain of O must be a subset of the set of free 
variables of the pattern p. 

3.2 Avoiding premature matches with apartness 

Suppose we want to simplify Equal Bool d. Equation (A) above 
fails to match, but (B) matches with a substitution El = [b i->- 
Bool, ci-td]. But it would be a mistake to simplify Equal Bool d 
to False. Consider the following code: 

type family Funlf (b :: Bool) :: * where 
Funlf True = Int —> Int 
Funlf False = () 

bad :: d —> Funlf (Equal Bool d) 
bad _ : () 
segFault:: Int 
segFault = bad True 5 

If we do simplify the type Equal Bool d to False then we can 
show that bad is well typed, since Funlf False is (). But then 
segFault calls bad with d instantiated to Bool. So segFault ex¬ 
pects bad True to return a result of type Funlf (Equal Bool Bool), 
which reduces to Int —> Int, so the call in segFault type-checks 
too. Result: we apply () as a function to 5, and crash. 

The error, of course, is that we wrongly simplified the type 
( Equal Bool d) to False ; wrongly because the choice of which 
equation to match depends on how d is instantiated. While the 
target ( Equal Bool d) does not match the earlier equation, there 
is a substitution for d that causes it to match the earlier equation. 
Our Candidate Rule 1 is insufficient to ensure type soundness. We 
need a stronger notion of apartness between a (target) type and a 
pattern, which we write as apart(p, t) in what follows. 

Candidate Rule 2 (Closed type family simplification). An equa¬ 
tion for a closed type family F can be used to simplify a target 
(F t) if (a) the target matches the LHS of the equation, and (b) 
every LHS p of an earlier equation for F is apart from the target; 
that is, apart(p,r). 

As a notational convention, apart(p, f) considers the lists p and 
r as tuples of types; the apartness check does not go element- 
by-element. We similarly treat uses of match and unify (defined 
shortly) when applied to lists. 

To rule out our counterexample to type soundness, apartness 
must at the very least satisfy the following property: 

Property 2 (Apartness through substitution). //'apart(p, t) then 
there exists no El such that match(p, $2 (t)). 

An appealing implementation of apart(p, t) that satisfies Prop¬ 
erty 2 is to check that the target r and the pattern p are not unifiable, 
under the following definition: 

Definition 3 (Unification). A type t\ unifies with a type T2 when 
there is a well-kinded substitution El such that fl(n) = El(rf). 
We write unify(ri, 72) = El for the most general such unifier if it 


4 For instance, the implementation of unify can be the standard first-order 
unification algorithm of Robinson. 


However this test is not sufficient for type soundness. Consider 
the type Equal Int (G Bool), where G is a type family. This type 
does not match equation (A), nor does it unify with (A), but it does 
match (B). So according to our rule, we can use (B) to simplify 
Equal Int (G Bool) to False. But, if G were a type function with 
equation 

type instance G Bool = Int 

then we could use this equation to rewrite the type to Equal Int Int, 
which patently does match (A) and simplifies to True! 

In our check of previous equations of a closed family, we wish 
to ensure that no previous equation can ever apply to a given ap¬ 
plication. Simply checking for unification of a previous pattern and 
the target is not enough. To rule out this counterexample we need 
yet another property from the apart(p, r) check, which ensures that 
the target cannot match a pattern of an earlier equation through ar¬ 
bitrary reduction too. 

Property 4 (Apartness through reduction), //"apartfp, r), then for 
any t' such that t t' : -imatch(p, t'). 

3.3 A definition of apartness 

We have so far sketched necessary properties that the apartness 
check must satisfy—otherwise, our type system surely is not sound. 
We have also described why a simple unification-based test does 
not meet these conditions, but we have not yet given a concrete 
definition of this check. 

Note that we cannot use Property 4 to define apart(p, t) be¬ 
cause it would not be well founded. We need apart(p, r) to de¬ 
fine how type families should reduce, but Property 4 itself refers to 
type family reduction. Furthermore, even if this were acceptable, 
it seems hard to implement. We have to ensure that, for any sub¬ 
stitution, no reducts of a target can possibly match a pattern; there 
can be exponentially many reducts in the size of the type and the 
substitution. 

Hence we seek a conservative but cheap test. Let us consider 
again why unification is not sufficient. In the example from the 
previous section, we showed that type Equal Int (G Bool) does 
not match equation (A), nor does it unify with (A). However, 
Equal Int (G Bool) can simplify to Equal Int Int and now 
equation (A) does match the reduct. 

To take the behavior of type families into account, we first 
flatten any type family applications in the arguments of the target 
(i.e., the types r in a target Ft) to fresh variables. Only then do 
we check that the new target is not unifiable with the pattern. This 
captures the notion that a type family can potentially reduce to any 
type—anything more refined would require advance knowledge of 
all type families, impossible in a modular system. In our example, 
we must check apart ((a, a), (Int, G Bool)) when trying to use the 
second equation of Equal to simplify Equal Int (G Bool). We 
first flatten (Int, G Bool) into (Int, x) (for some fresh variable x). 
Then we check whether (a, a) cannot be unified with (Int, x). We 
quickly discover that these types can be unified. Thus, (a, a) and 
(Int, G Bool) are not apart and simplifying Equal Int (G Bool) 
to False is prohibited. 

What if two type family applications in the target type are 
syntactically identical? Consider the type family F below: 

type family Fab where 

F Int Bool = Char 
F a a = Bool 

Should the type F (G Int) (G Int) be apart from the left-hand- 
side F Int Bool ? If we flatten to two distinct type variables then 
it is not apart; if we flatten using a common type variable then it 
becomes apart. How can we choose if flattening should preserve 
sharing or not? Let us consider the type F b b, which matches 


2013/11/15 



the second equation. It is definitely apart from F Int Bool and 
can indeed be simplified by the second equation. What happens, 
though, if we substitute G Int for b in F b b7 If flattening did not 
take sharing into account, ( G Int, G Int) would not be apart from 
(Int, Bool), and F ( G Int) (G Int ) wouldn’t reduce. Hence, the 
ability to simplify would not be stable under substitution. This, in 
turn, threatens the preservation theorem. 

Thus, we must identify repeated type family applications and 
flatten these to the same variable. In this way, F (G Int) (G Int) 
is flattened to F x x (never F x y), will be apart from the first 
equation, and will be able to simplify to Bool, as desired. 

With these considerations in mind, we can now give our imple¬ 
mentation of the apartness check: 

Definition 5 (Flattening). To flatten a type r into t', written t' = 
flatten (r), process the type t in a top-down fashion, replacing 
every type family application with a type variable. Two or more 
syntactically identical type family applications are flattened to the 
same variable; distinct type family applications are flattened to 
distinct fresh variables. 

Definition 6 (Apartness). To testfor apart(p, t), letr = flatten(r) 
and check unify(p, r'). If this unification fails, then p and t are 
apart. More succinctly: apart (p, r) = ->unify(p, flatten(r)). 

We can show that this definition does indeed satisfy the identi¬ 
fied necessary properties from Section 3.2. In Section 5.1 we will 
also identify the sufficient conditions for type soundness for any 
possible type-safe implementation of apartness, show that these 
conditions imply the properties identified in the previous section 
(a useful sanity check!) and prove that the definition of apartness 
that we just proposed meets these sufficient conditions. 

3.4 Allowing more reductions with compatibility 

Checking for apartness in previous equations might be unneces¬ 
sarily restrictive. Consider this code, which uses the function And 
from Section 2.1: 

f ::T a —¥ T b —¥ T (And a b) 
tt \\T True 
g ■. ■. T a —r T a 
g x= f x tt 

Will the definition of g type-check? Alas no: the call (f x tt) 
returns a result of type T (And a True), and that matches neither 
of the equations for And. Perhaps we can fix this by adding an 
equation to the definition of And, thus: 

type family And (a :: Bool) (b :: Bool) :: Bool where 

And True True = True — (1) 

And a True = a — (2) 

And a b = False — (3) 

But that does not work either: the target (And a True) matches (2) 
but is not apart from (1), so (2) cannot fire. And yet we would like 
to be able to simplify (And a True) to a, as Eqn (2) suggests. Why 
should this be sound? Because anything that matches both (1) and 
(2) will reduce to True using either equation. We say that the two 
equations coincide on these arguments. When such a coincidence 
happens, the apartness check is not needed. 

We can easily formalize this intuition. Let us say that two 
equations are compatible when any type that matches both left- 
hand sides would be rewritten by both equations to the same result, 
eliminating non-convergent critical pairs in the induced rewriting 
system: 

Definition 7 (Compatibility). Two type-family equations p and 
q are compatible iff fli(lhs p ) = Fl2(lhs q ) implies $?i (rhs p ) = 
Fl2(rhs q ). 


For example, (1) and (2) are compatible because a type, such as 
And True True, would be rewritten by both to the same type, 
namely True. It is easy to test for compatibility: 

Definition 8 (Compatibility implementation). The test for compat¬ 
ibility, written compat(p, q), checks that unify (lhs p ,lhs q ) = O im¬ 
plies Q(rhs p ) = Sl(rhs q ). If unify(lhs p ,lhs q ) fails, compat(p, q) 
holds vacuously. 

The proof that compatfp, q) implies that p and q are compatible 
appears in Appendix G and is straightforward. We can now state 
our final simplification rule for closed type families: 

Rule 9 (Closed type family simplification). An equation q of a 
closed type family can be used to simplify a target application F t 
if the following conditions hold: 

1. The target t matches the type pattern lhs q . 

2. For each earlier equation p, either compat(p, q) or 
apart (lhs p ,r). 

For example, we can fire equation (2) on a target that is not apart 
from (1), because (1) and (2) are compatible. We show that Rule 9 
is sufficient for establishing type soundness in Section 5. 

Through this use of compatibility, we allow for a limited form 
of theorem proving within a closed type family definition. The fact 
that equation (2) is compatible with (1) essentially means that the 
rewrite rule for (2) is admissible given that for (1). By being able 
to write such equations in the closed type family definition, we can 
expand Haskell’s definitional equality to relate more types. 

3.5 Optimized matching 

In our original Candidate Rule 2 above, when simplifying a target 
Ft with an equation q, we are obliged to check apart(7/?.s p , r), 
for every earlier equation p. But much of this checking is wasted 
duplication. For example, consider 
type family F a where 
F Int = Char - (1) 

F Bool = Bool — (2) 

F x = Int - (3) 

If a target matches (2) there is really no point in checking its 
apartness from (1), because anything that matches (2) will be apart 
from (1). We need only check that the target is apart from any 
preceding equations that could possibly match the same target. 

Happily, this intuition is already embodied in our new simplifi¬ 
cation Rule 9. This rule checks compatfp, q) V apart(//).y p , t) for 
each preceding equation p. But we can precompute compatfp, q) 
(since it is independent of the target), and in the simplification rule 
we need check apartness only for the pre-computed list of earlier 
incompatible equations. In our example, equations (1) and (2) are 
vacuously compatible, since their left-hand sides do not unify, and 
hence no type can match both. Thus, there is no need to check for 
apartness from (1) of a target matching (2). 

3.6 Compatibility for open families 

As discussed in the introduction, type instance declarations for 
open type families must not overlap. With our definition of com¬ 
patibility, however, we can treat open and closed families more 
uniformly by insisting that any two instances of the same open type 
family are compatible: 

Definition 10 (Open type family overlap check). Every pair 
of equations p and q for an open type family F must satisfy 
compat(p, q). 

Notice that this definition also allows for coincident right-hand 
sides (as in the case for closed type families. Section 3.4). For 
example, these declarations are legal: 
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type family Coincide a b 
type instance Coincide Int b = Int 
type instance Coincide a Bool = a 
These equations overlap, but in the region of overlap they always 
produce the same result, and so they should be allowed. (GHC 
already allowed this prior to our extensions.) 

3.7 Type inference for closed type families 

Given the difficulty of type inference for open type families (Chak- 
ravarty et al. 2005; Schrijvers et al. 2008), how do we deal with 
closed ones? Thankfully, this turns out to be remarkably easy: we 
simply use Rule 9 to simplify closed families in exactly the same 
stage of type inference that we would simplify an open one. The 
implementation in GHC is accordingly quite straightforward. 

Despite the ease of implementation, there are perhaps complex 
new possibilities opened by the use of closed families—these are 
explored in Section 7.6. 

4. System |uFC: formalizing the problem 

Thus far we have argued informally. In this section we formalize 
our design and show that it satisfies the usual desirable properties 
of type preservation and progress, assuming termination of type 
family reduction. It is too hard to formulate these proofs for all 
of Haskell, so instead we formalize pFC, a small, explicitly-typed 
lambda calculus. This is more than a theoretical exercise: GHC 
really does elaborate all of Haskell into System FC (Sulzmann et al. 
2007a; Weirich et al. 2013), of which pFC is a large subset that 
omits some details of FC—such as kind polymorphism (Yorgey 
et al. 2012)—that are irrelevant here. 

4.1 System pFC 

System pFC is an extension of System F, including kinds and 
explicit equality coercions. Its syntax is presented in Figure 2. This 
syntax is very similar to recent treatments of System FC (Weirich 
et al. 2013). We omit from the presentation the choice of ground 
types and their constructors and destructors, as they are irrelevant 
for our purposes. 

There are a few points to note about type families, all visible in 
Figure 2. A type family has a particular arity, and always appears 
saturated in types. That explains the first-order notation F(k):k' in 
ground contexts E, and F(t) in types. 

A closed type family appears in pFC as a kind signature 
F(k):k ', and a single axiom C:F, both in the top-level ground 
context E. The “type” 'P of the axiom is a list of equations, each 
of form [aTTc]. F(r) ~ o, just as we have seen before except that 
the quantification is explicit. For example, the axiom for Equal 
(restricted for simplicity to kind *) looks like this: 

axiomEq : [a :*].(Equal a a) ~ True ; 

[a:*,/3:*].(Equal a1 3) ~ False 

Although our notation for lists does not make it apparent, we 
restrict the form of the equations to require that F refers to only 
one type family—that is, there are no independent F,. We use 
subscripts on metavariables to denote which equation they refer to, 
and we refer to the types pi as the type patterns of the i’th equation. 
We assume that the variables a bound in each equation are distinct 
from the variables bound in other equations. 

An open type family appears as a kind signature and zero or 
more separate axioms, each with one equation. 

4.2 Static semantics 

Typing in pFC is given by the judgments in Figure 3. Most of the 
rules are uninteresting and are thus presented in Appendix C. The 
typing rules for expressions are entirely straightforward. The only 


Expressions: 

e ::= x \ Xx:r.e | ei ei \ Acc.n.e \ er 
| e > 7 Cast 

| ... Constructors and destruc¬ 

tors of datatypes 

Types: 

t,o, ::= a | n -A r 2 | V or.n.T 

tj),v | n r 2 Application 

| F(t) Saturated type family 

| H Datatype, such as Int 


p denotes a type pattern (with no type families) 
k ::= * | Ki —» «2 Kinds 


Propositions: 

<j> n ~ 7*2 Equality propositions 

$ ::= §5ck]. F(p) ~ cr Axiom equations 

'P ■;=; $ List of axiom eqns. (axiom types) 


Coercions: 
7,r? ::= 


71 -A 72 

| V a-.K. 7 1 71 72 | F( 7) 

<T> 

Reflexivity 

sym7 

Symmetry 

7 i 972 

Transitivity 

left 7 

Left decomposition 

right 7 

Right decomposition 

C[i\T 

Axiom application 


Contexts: 

Ground: E 

Variables: A 

Combined: T 

Substitutions: Q 


■ | E, H:k -a * | E, F(k):k' | E, C:V 
• | A ,x:t | A ,or.K 
E; A 


Figure 2. The grammar of System pFC 


r htm e : r 

Expression typing 

r hty t : n 

Type kinding 

r h co 7 : 0 

Coercion typing 

h-gnd E 

Ground context validity 

E h> ar A 

Variables context validity 

bet x r 

Context validity 


Figure 3. Typing judgments for System pFC 


noteworthy rule is the one for casting, which gives the raison d’etre 
for coercions: 


r hco 7 : Ti ~ 7*2 r htm e : n 
r htm e > 7 : 7*2 


Tm.Cast 


Here, we see that a cast by a coercion changes the type of an ex¬ 
pression. This is what we mean by saying that a coercion witnesses 
the equality of two types—if there is a coercion between n and 7*2, 
then any expression of type n can he cast into one of type 7*2. 

The rules for deriving the kind of a type are straightforward and 
are omitted from this presentation. 


4.3 Coercions and axiom application 

Coercions are less familiar, so we present the coercion typing rules 
in full, in Figure 4. The first four rules say that equality is congru¬ 
ent —that is, types can be considered equal when they are formed of 
components that are considered equal. The following three rules as¬ 
sert that coercibility is a proper equivalence relation. The Co Left 
and Co_Right rules assert that we can decompose complex equal- 
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I r h ‘° 7 : * 


Coercion typing 


r ieo 71 : T 1 ~ ft r l-Jo 72 : r 2 ~ T 2 

r hty n -» r 2 : * _ 

r h co 71 —> 72 : (ti -A r 2 ) ~ (rj -A r^) 

T, q:k hto 7 : n ~ T2 r hty V a:K.Ti : * 

T h co Vo:k .7 : (Vq:k.ti) ~ (Vq:k.t 2 ) 


Co_Arrow 


Co_Forall 


n- C 0 7 i:ri~ ffi r hjo 72 : t 2 ~ cr 2 

T hty n r 2 : k _ 

T h co 71 72 : (ri r 2 ) ~ (or ct 2 ) 


Co_App 


ities to simpler ones. These formation rules are incomplete with 
respect to some unspecified notion of semantic equality—that is, 
we can imagine writing down two types that we “know” are equal, 
but for which no coercion is derivable. For example, there is no 
way to use induction over a data structure to prove equality. How¬ 
ever, recall that these coercions must all be inferred from a source 
program, and it is unclear how we would reliably infer inductive 
coercions. 

The last rule of coercion formation, Co AXIOM, is the one 
that we are most interested in. The coercion C[i\ t witnesses the 
equality obtained by instantiating the i’th equation of axiom C with 
the types r. For example, 

axiomEq[ 0] Int : Equal Int Int ~ True 


r hc 0 7 : Ti ~ • 
r b ty F(t r) : k 
r b co F(7) : F(t r) - 


r hty t : K 
r hco (r) : r ~ r 
r h co 7 : n ~ T2 
r fco sym 7 : r 2 ~ n 

r u 71 : n ~ t 2 r hco 72 


nni 

— Co_Refl 


T h co 7i 9 72 : 
T h co 7 : n t 2 * 


r 3 


T ho left 7 : n ~ <Ti 
r ho 7 : n r 2 ~ ffi cr 2 

r hy T2 : K r hy cr 2 : K 

r ho right 7 : r 2 ~ cr 2 


C:* £ S * = » HP) ^ v 

E; A h ty r : /ti h ctx E; A 
Vjf < i, no_conflict('I/, i,T,j) 

- ^Co_A> 

£; A ho G[i\ t : F(p t {r/^l) ~ Vi { T / ai \ 

no_conflict(^ f ,i,r,j) | Check for equation conflicts 

^ = [ATTt], F{p) ~ v apart (ft, Pi[r/aj]) 


no_conflict(^t, i,T,j) 
compat(tF[«], ^[7]) 
no_conflict('F, i,r,y) 


NC.Compatible 


| compat($i, $ 2 ) | Equation compatibility 


$1 = teplj. F(pi) ~ ■ 
$2 = [a2TK2]. F{fi2) ~ ■ 
unify(pT, pii) = 

fi(ui) = fi(v 2 ) 

compat($i, $ 2 ) 

$1 = ferRtJ. F(f 1} ^ 
$2 = [chTKh]. F(p^) r' 
unify(pT, fails 


COMPAT_COINCIDENT 


Compat_Distinct 


Figure 4. Coercion formation rules 


This says that if we pick the first equation of axiomEq (we in¬ 
dex from 0), and instantiate it at Int, we have a witness for 
Equal Int Int ~ True. 

Notice that the coercion C\i\ T specifies exactly which equation 
is picked (the i’th one); pFC is a fully-explicit language. However, 
the typing rules for pFC must reject unsound coercions like 
axiomEq[l] Int Int : Equal Int Int ~ False 
and that is expressed by rule Co Axiom. The premises of the rule 
check to ensure that E; A is a valid context and that all the types r 
are of appropriate kinds to be applied in the i’th equation. The last 
premise implements Rule 9 (Section 3.4), by checking no_conflict 
for each preceding equation j. The no conflict judgment simply 
checks that either (NC_Compatible) the i’th and j’th equation 
for C are compatible, or (NC -Apart) that the target is apart from 
the LHS of the j’th equation, just as in Rule 9. 

In NC Compatible, note that the com pat judgment does not 
take the types r: compatibility is a property of equations, and is 
independent of the specific arguments at an application site. The 
two rules for compat are exactly equivalent to Definition 8. 

These judgments refer to algorithms apart and unify. We as¬ 
sume a correct implementation of unify and propose sufficient 
properties of apart in Section 5.1. We then show that our chosen 
algorithm for apart (Definition 6) satisfies these properties. 

As a final note, the rules do not check the closed type family 
axioms for exhaustiveness. A type-family application that matches 
no axiom simply does not reduce. Adding an exhaustiveness check 
based on the kind of the arguments of the type family might be a 
useful, but orthogonal, feature. 

5. Metatheory 

A summary of the structure of the type safety proof, highlighting 
the parts that are considered in this paper, is in Figure 5. Our 
main goals are to prove (i) the substitution lemma of types into 
coercions (Section 5.2), and (ii) a consistency property that ensures 
we never equate two types such as Int and Bool (Section 5.3). The 
substitution and consistency lemmas lead to the preservation and 
progress theorems respectively, which together ensure type safety. 
We omit the operational semantics of pFC as well as the other 
lemmas in the main proofs of preservation and progress, because 
these are all direct adaptations from previous work (Weirich et al. 
2011; Sulzmann et al. 2007a). 

We stress that, as Figure 5 indicates, we have proved type safety 
only for terminating type families. What exactly does that mean? 
We formally define the rewrite relation, now written E b • • to 

explicit mention the set of axioms, with the following rule: 

C € E triSS ~ V _ 

b gnd E •//.,] 

V j < i, no-ConflictjT', i, tp, j) D 

EI-C[F(t)]-»C[t'] RLIJ 
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Coercion subst. lemma 
(§5.2) 


GoodS 

(§5.4) 


Confluence 


Term subst. lemma 


Consistency (§5.3) 

I 

Progress 


Figure 5. Structure of type safety proof. The arrows represent 
implications. The nodes highlighted in gray are the parts considered 
in the present work. 


In the conclusion of this rule, C\-\ denotes a type context with 
exactly one hole. Its use in the rule means that a type family 
can simplify anywhere within a type. Note that the no_conflict 
premise of this rule is identical to that of the Co Axiom rule. 
By “terminating type families” we mean that the E >'■ • • 

relation cannot have infinite chains. We discuss non-terminating 
type families in Section 6. 

As a notational convention, we extend the relation to lists of 
types by using E I- fj ■*+ Tz to mean that exactly one of the types 
in rT steps to the corresponding type in tz; in all other positions n 
and tz are identical. 

5.1 Preliminaries: properties of unification and apartness 

In order to prove properties about no_conflict, we must assume the 
correctness of the unification algorithm: 

Property 11 (unify correct). If there exists a substitution O such 
thatQ(o) = S2(t), then unifyja, t) succeeds. If unify(o,r) = f2 
then O is a most general unifier ofo and r. 

In Section 3.2, we gave some necessary properties of apart, 
namely Properties 2 and 4. To prove type soundness we need suf¬ 
ficient properties, such as the following three. Any implementation 
of apart that has these three properties would lead to type safety. 
We prove (in Appendix F) that the given algorithm for apart (Def¬ 
inition 6) satisfies these properties. Due to flattening in the defini¬ 
tion of apart, this proof is non-trivial. As a sanity check, we also 
prove that the sufficient properties imply the necessary ones of Sec- 


Property 14 (Apartness can be regained after reduction). If r = 
f2(p) and E h r t', then there exists a r" such that 

1. Eb r' t", 

2. 1^5= f V fp) for some Q', and 

3. for every p' such that apart(p', r): apart(p', r"). 

Here is an example of Property 14 in action. Consider the following 
type families F and G: 
type family F a where 

F ( Int, Bool) = Char - (A) 

F (a, a) = Bool - (B) 
type family G x where G Int = Double 
Suppose that our target is F (G Int, G Int), and that our partic¬ 
ular implementation of apart allows equation (B) to fire; that is, 
apart ((Int, Bool), ( G Int, G Int)). Now, suppose that instead of 
firing (B) we chose to reduce the first G Int argument to Double. 
The new target is now F ( Double, G Int). Now (B) cannot fire, 
because the new target simply does not match (B) any more. Prop¬ 
erty 14 ensures that there exist further reductions on the new target 
that make (B) Arable again—in this case, stepping the second G Int 
to Double does the job. Conditions (2) and (3) of Property 14 for¬ 
malize the notion “make (B) Arable again”. 

5.2 Type substitution in coercions 

System pFC enjoys a standard term substitution lemma. This 
lemma is required to prove the preservation theorem. As shown 
in Figure 5, the term substitution lemma depends on the substi¬ 
tution lemma for coercions. We consider only the case of interest 
here, that of substitution in the rule CO-Axiom. 

Lemma 15 (Co_Axiom Substitution). IfE; A, ,3:k, A' bo C[i T 
F(pi\rjai]) ~ v ,[T/af\and E; A by o : K, then E; A, A '[<j/0\ b co 
C[t] ■■ F{ P% iFJ^][o/p]) ~ V i [^][<r/I3]. 

The proof of this lemma, presented in Appendix D, proceeds 
by case analysis on the no_conflict judgment. It requires the use 
of the (standard) type substitution lemma and Property 12, but is 
otherwise unremarkable. 

5.3 Consistency 

As discussed at the beginning of this section, to establish progress 
we must show consistency. Consistency ensures that we can never 
deduce equalities between distinct value types, denoted with £: 

£ :m.. H t I T1 -A Tz I V cck.t 

For example, Int, Bool, and V a:*.a —t a are all value types. A set 
of axioms is consistent if we cannot deduce bogus equalities like 
Int ~ Bool or Int <v V a:*.a —t a: 

Definition 16 (Consistent contexts). A ground context E is consis¬ 
tent if, for all coercions 7 such that E; • bo 7 : £1 ~ £z.' 

2- if Si = Hri, then £2 = H tz, 

2. if£ 1 = n —¥ t[, then £2 = rz —» 72, and 

3. if £i:‘-®V cc.K.Ti, then £2 = V/?:/«.Tz. 


Property 12 (Apartness is stable under type substitution). If 
apart (p, r), then for all substitutions O, apart(p, I2 (t)). 

Property 13 (No unifiers for apart types), (/apartfp, r), then there 
exists no substitution D such that 12(p) = 12(r). 

The final property of the apartness check is the most complex. It 
ensures that, if an equation can fire for a given target and that target 
steps, then it is possible to simplify the reduct even further so that 
the same equation can fire on the final reduct. 


How can we check whether an axiom set is consistent? It is 
extremely hard to do so in general, so instead, following previous 
work (Weirich et al. 2011), we place syntactic restrictions on the 
axioms that conservatively guarantee consistency. A set of axioms 
that pass this check are said to be Good. We then prove the 
consistency lemma: 

Lemma 17 (Consistency). // Good E, then E is consistent. 

Following previous proofs, we show that if Good E and 
E; • bo 7 : <ti ~ 02, then <j\ and 02 have a common reduct 
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type instance A = C A 
type instance C x = D x (C x) 
type instance D x x = Int 

(1) A~> C A~* D A{C A) -w D (C A) (C A) Int 

(2) A-^ C A-^; ym C Int 

Int and C Int have no common reduct. 


<7 o (To (To 

•/ \* / \ / \ 


<71 (72 (71 (72 (Tl (72 



(a) Confluence (b) Local confluence (c) Local diamond 


Figure 6. Graphical representation of confluence properties. A 
solid line is a universally quantified input, and a dashed line is an 
existentially quantified output. 


in the relation. Because the simplification relation preserves 
type constructors on the heads of types, we may conclude that E is 
consistent. 

However, one of the cases in this argument is transitivity: the 
joinahility relation must be transitive. That is, if n and T2 have a 
common reduct a 1, and if T2 and T3 have a common reduct 02 , then 
n and T3 must have a common reduct (they are joinable). To show 
transitivity of joinahility, we must show confluence of the rewrite 
relation, in order to find the common reduct of <ti and 02 (which 
share T2 as an ancestor). 

Our approach to this problem is to show local confluence (see 
Figure 6) and then use Newman’s Lemma (1942) to get full con¬ 
fluence. Newman’s Lemma requires that the rewrite system is 
terminating—this is where the assumption of termination is used. 
The full, detailed proof appears in Appendix E. 

5.4 Good contexts 

What sort of checks should be in our syntactic conditions. Good? 
We would like Good to be a small set of common-sense conditions 
for a type reduction system, such as the following: 

Definition 18 (Good contexts). We have Good E whenever the 
following four conditions hold: 

1. For all C'.'h G E: fit is of the form [offc]. F(p) <v v where all 
of the Fi are the same type family F and all of the type patterns 
pi do not mention any type families. 

2. For all C:'F G E and equations [oTk]. F(p) ~ v in 4/: the 
variables a all appear free at least once in p. 

3. For all C:'F G E: if fit defines an axiom over a type family F 

and has multiple equations, then no other axiom G E 

defines an axiom over F. That is, all type families with ordered 
equations are closed. 

4. For all Gi:3>i G E and Ci'&t € E (each with only one 
equation), compat($i, $2). That is, among open type families, 
the patterns of distinct equations do not overlap. 

The clauses of the definition of Good are straightforward 
syntactic checks. In fact, these conditions are exactly what GHC 
checks for when compiling type family instances. This definition 
of Good leads to the proof of Lemma 39, as described above. 

6. Non-terminating type families 

By default GHC checks every type family for termination, to guar¬ 
antee that the type checker will never loop. Any such check is 
necessarily conservative; indeed, GHC rejects the TMember func¬ 
tion of Section 2.4 (Schrijvers et al. 2008). Although GHC’s test 
could readily be improved, any conservative check limits expres¬ 
siveness or convenience, so GHC allows the programmer to disable 


Figure 7. Counter-example to confluence 

the check. This may make the type checker loop, but it should not 
threaten soundness. 

However, the soundness result of Section 5 covers only termi¬ 
nating type families. Surprisingly (to us) non-termination really 
does lead to a soundness problem (Section 6.1). We propose a so¬ 
lution that (we believe) rules out this problem (Section 6.2), but 
explain why the main result of this paper is difficult to generalize 
to non-terminating type families, leaving an open problem for fur¬ 
ther work. 

6.1 The problem with infinity 

Consider this type family, adapted from Huet (1980): 
type family D x where 
D ([£>], £>) = Bool 
D (c, c) = Int 

We wish to simplify the target D (a, a). The type (a, a) matches 
the second pattern (c, c), but is it apart from the first pattern 
([b], b)2 Definition 6 asserts that they are apart since they do not 
unify: unification fails with an occurs check error. Accordingly, 
Rule 9 would simplify D (a, a) to Int. But consider the following 
definitions, where type family Loop is a nullary (0-argument) type 
family: 

type family Loop 

type instance Loop = [Loop] 

If we instantiate a with Loop we get (Loop, Loop) which can sim¬ 
plify to ([Loop], Loop). The latter does match the pattern ([b], b), 
violating Property 4, a necessary condition for soundness. 

So, in a non-terminating system our apartness check is unsound. 
Concretely, using our apartness implementation from Definition 6, 
we can equate types Int and Bool, thus: 

Int ~ D (Loop, Loop) ~ D ([Loop], Loop) ~ Bool 
Conclusion: we must not treat (a, a) as apart from the pattern 
([£>], b), even though they do not unify. In some ways this is not 
so surprising. In our earlier examples, apartness was based on 
an explicit contradiction (“a Bool cannot be an Int”), but here 
unification fails only because of an occurs check. As the Loop 
example shows, allowing non-terminating type-family definitions 
amounts to introducing infinite types, and if we were to allow 
infinite types, then (a, a) does unify with ([£>], b )! 

6.2 Fixing the problem 

The problem with the current apartness check is that finite unifica¬ 
tion fails too often. We need to replace the unification test in the 
definition of apartness with unification over infinite types: 
Definition 19 (Infinite unification). Two types n, T2 are infinitely 
unifiable, written unify 00 (ri, T2), if there exists a substitution uj 
whose range may include infinite types, such that ui(t 1) = u>(t 2). 

For example types (a, a) and ([b], b) are unifiable with a sub¬ 
stitution uj = [aH [[[...]]], b i-)- [[[...]]]]. Efficient algorithms 
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to decide unification over infinite types (and compute most gen¬ 
eral unifiers) have existed for some time and are based on well- 
established theory (Huet 1976; Courcelle 1983). See Jaffar (1984) 
for such an algorithm, and Knight (1989) for a general survey. 

We conjecture that replacing all uses of unify with unify^ 
in our definitions guarantees soundness, even in the presence of 
non-terminating family equations. Alas, this conjecture turns out 
to be very hard to prove, and touches on open problems in the 
term-rewriting literature. For example, a rewrite system that has 
(a) infinite rewrite sequences and (b) non-left-linear patterns, does 
not necessarily guarantee confluence, even if its patterns do not 
overlap. Figure 7 gives an example, from Klop (1993). 

Notice that replacing unify with unify^ may change the reduc¬ 
tion relation. For example, a target which is apart from a pattern 
with a unify-based apartness check may no longer be apart from the 
same pattern with the more conservative unify^-based apartness 
check. Yet, type safety (for terminating axiom sets) is not compro¬ 
mised since Property 11 carries over to unification algorithms over 
infinite types (Fluet 1976). 

6.3 Ramifications for open families 

We pause briefly to consider the implications for GHC’s existing 
open type families. GHC allows the following definition for an 
open type family D’\ 

type family D' x y 

type instance D' [b] b= Bool 

type instance D’ c c = Int 

As described in Section 2, the type instance equations of an open 
type family are required to have non-overlapping left-hand sides, 
and GHC 7.6 believes that the two equations do not overlap because 
they do not unify. But, using certain flags, GHC also accepts the 
definition of Loop, and the target (D’ Loop Loop) demonstrates 
that the combination is unsound precisely as described above. 5 

Happily, if the conjecture of Section 6.2 holds true, we can apply 
the same fix for open families as we did for closed families: simply 
use unify^ instead of unify when checking for overlap. Indeed, this 
is exactly how we have corrected this oversight in GHC 7.8. 

7. Discussion and Future Work 

The study of closed type families opens up a wide array of related 
issues. This section discusses some of the more interesting points 
we came across in our work. 

7.1 Denotational techniques for consistency 

We do not have a proof of consistency for a system with non¬ 
terminating, non-left-linear axioms (even when using unify^ in¬ 
stead of unify). We have seen that confluence is false, and hence 
cannot be used as a means to show consistency. 

A possible alternative approach to proving consistency—side¬ 
stepping confluence—is via a denotational semantics for types. We 
would have to show that if we can build a coercion 7 such that 
r h 7 : t ~ a, then Jr] = [<t] , for some interpretation of types 
into a semantic domain. The “obvious” domain for such a seman¬ 
tics, in the presence of non-terminating computations, is the domain 
that includes _L as well as finite and infinite trees. Typically in de¬ 
notational semantics, recursive type families would be interpreted 
as the limit of approximations of continuous functions. However, 
the “obvious” interpretation of type families in this simple domain 
is not monotone. Consider this type family: 


5 Akio Takano has posted an example of how this can cause a program to 
fail, at http: //ghc .haskell. org/trac/ghc/ticket/8162. 


type family Fab where 

F x x = Int 

F [x] ( Maybe x) = Char 

It is the case that (_L C [_L]) and (_L C Maybe _L), but the semantic 
interpretation of F, call it /, should satisfy /(_L, _L) = Int and 
/([_L], Maybe _L) = Char. Hence, monotonicity breaks. The lack 
of monotonicity means that limits of chains of approximations do 
not exist, and thus that interpretations of functions, such as /, are 
ill-defined. 

An alternate definition would give /(!., A) = -L, but then sub- 
stitutivity breaks. Indeed, the proof theory can deduce that F x x is 
equal to Int for any type x, even those that have denotation _L. 

Alternatively to these approaches, one might want to explore 
different domains to host the interpretation of types. 

7.2 Conservativity of apartness 

We note in Section 3.3 that our implementation of apartness is 
conservative. This conservativity is unavoidable—it is possible for 
open type families to have instances scattered across modules, 
and thus the apartness check cannot adequately simplify the types 
involved in every case. However, the current check considers none 
of the type family axioms available, even if one would inform the 
apartness check. For example, consider 
type family G a where 
G Int = Bool 
G [a] = Char 

and we wish to simplify target Equal Double (G b). It is clear 
that an application of G can never simplify to Double, so we could 
imagine a more refined apartness check that could reduce this target 
to False. We leave the details of such a check to future work. 

7.3 Conservativity of coincident overlap: partial knowledge 

It is worth noting that the compatibility check (Definition 8) is 
somewhat conservative. For example, take the type family 

type family Fab where 

F Bool c = Int 
F d e = e 

Consider a target F g Int. The target matches the second equation, 
but not the first. But, the simplification rule does not allow us to fire 
the second equation—the two equations are not compatible, and the 
target is not apart from the first equation. Yet it clearly would be 
safe to fire the second equation in this case, because even if g turns 
out to be Bool, the first equation would give the same result. 

It would, however, be easy to modify F to allow the desired 
simplification: just add a new second equation F a Int = Int. This 
new equation would be compatible with the first one and therefore 
would allow the simplification of F g Int. 

7.4 Conservativity of coincident overlap: requiring syntactic 
equality 

The compatibility check is conservative in a different dimension: it 
requires syntactic equality of the RHSs after substitution. Consider 
this tantalizing example: 


type family Plus a b where 


Plus Zero 

a = a 

- (A) 

Plus (Succ b) 

c = Succ (Plus b c) 

- (B) 

Plus d 

Zero = d 

- (C) 

Plus e 

(Succ f) = Succ (Plus e f) 

- (D) 


If this type family worked as one would naively expect, it would 
simplify an addition once either argument’s top-level constructor 
were known. (In other dependently typed languages, definitions 


2013/11/15 



like this are not possible and require auxiliary lemmas to reduce 
when the second argument’s structure only is known.) Alas, it does 
not work as well as we would hope. The problem is that not all 
the equations are compatible. Let’s look at (B) and (C). To check 
if these are compatible, we unify ((Succ b), c ) with (d, Zero) to 
get [c i—^ Zero, d i-A Succ b ]. The right-hand sides under this 
substitution are Succ ( Plus b Zero) and Succ b. However, these 
are not syntactically identical, so equations (B) and (C) are not 
compatible, and a target such as Plus g Zero is stuck. 

Why not just allow reduction in the RHSs before checking for 
compatibility? Because doing so is not obviously well-founded! 
Reducing the Succ ( Plus b Zero) type that occurred during the 
compatibility check above requires knowing that equations (B) and 
(C) are compatible, which is exactly what we’re trying to establish. 
So, we require syntactic equality to support compatibility, and leave 
the more general check for future work. 

7.5 Lack of inequality evidence 

One drawback of closed type families is that they sometimes do 
not compose well with generalized algebraic datatypes (GADTs). 
Consider the following sensible-looking example: 
data X a where 
XInt :: X Int 
XBool :: X Bool 
XChar :: X Char 
type family Collapse a where 
Collapse Int = Int 
Collapse x = Char 
collapse :: X a —> X (Collapse a) 
collapse Xlnt = Xlnt 
collapse _ ’=4 XChar 

The type function Collapse takes Int to itself and every other type 
to Char. Note the type of the term-level function collapse. Its im¬ 
plementation is to match Xlnt —the only constructor of X param¬ 
eterized by Int —and return Xlnt; all other constructors become 
XChar. The structure of collapse exactly mimics that of Collapse. 
Yet, this code does not compile. 

The problem is that the type system has no evidence that, in the 
second equation for collapse, the type variable a cannot be Int. So, 
when type-checking the right-hand side XChar, it is not type-safe 
to equate Collapse a with Char. The source of this problem is that 
the type system has no notion of inequality. If the case construct 
were enhanced to track inequality evidence and axiom application 
could consider such evidence, it is conceivable that the example 
above could be made to type-check. Such a notion of inequality has 
not yet been considered in depth, and we leave it as future work. 

7.6 Type inference 

The addition of closed type families to Haskell opens up new possi¬ 
bilities in type inference. By definition, the full behavior of a closed 
type family is known all at once. This closed-world assumption al¬ 
lows the type inference engine to perform more improvement on 
types than would otherwise be possible. Consider the following 
type family: 

type family Inj a where 
Inj Int = Bool 
Inj Bool = Char 
Inj Char = Double 

Type inference can discover in this case that Inj is indeed an 
injective type function. When trying to solve a constraint of the 
form Inj Int ~ Inj q the type inference engine can deduce that 
q must be equal to Int for the constraint to have a solution. By 


contrast, if Inj were not identified as injective, we would be left 
with an unsolved constraint as in principle there could be multiple 
other types for q that could satisfy Inj Int ~ Inj q. 

Along similar lines, we can imagine improving the connection 
between Equal and (~). Currently, if a proof a ~ b is available, 
type inference will replace all occurrences of a with b, after which 
Equal a b will reduce to True. However, the other direction does 
not work: if the inference engine knows Equal a b ~ True, it will 
not deduce a ~ b. Given the closed definition of Equal, though, 
it seems possible to enhance the inference engine to be able to go 
both ways. 

These deductions are not currently implemented, but remain as 
compelling future work. 

8. Related work 

8.1 Previous work on System FC 

The proof of type soundness presented in this paper depends heav¬ 
ily on previous work for System FC, first presented by Sulzmann 
et al. (2007a). That work proves consistency only for terminating 
type families, as we do here. 

In a non-terminating system, local confluence does not imply 
confluence. Therefore, previous work (Weirich et al. 2011) showed 
confluence of the rewrite system induced by the (potentially non¬ 
terminating) axiom set by establishing a local diamond property 
(see Figure 6). However, the proof took a shortcut: the require¬ 
ments for good contexts effectively limited all axioms to be left- 
linear. The local diamond proof relies on the fact that, in a system 
with linear patterns, matching is preserved under reduction. For in¬ 
stance, consider these axioms: 

type instance F a b = H a 

type instance G Int = Bool 

The type F (G Int) ( G Int) matches the equation for F and can 
potentially simplify to F (G Int) Bool or to F Bool (G Int) or 
even to F Bool Bool. But, in all cases the reduct also matches 
the very same pattern for F, allowing local diamond property to be 

What is necessary to support a local diamond property in a 
system with closed type families, still restricted to linear patterns? 
We need this property: If F r can reduce by some equation q, and 
t t' , then F r 1 can reduce by that same equation q. With only 
open families, this property means that matching must be preserved 
by reduction. With closed families, however, both matching and 
apartness must be preserved by reduction. Consider the definition 
for F' below (where H is some other type family): 

type family F' a b where 
F' Int Bool = Char 
F’ a b = H a 

We know that F' (G Int) (G Int) matches the second equation 
and is apart (Definition 6) from the first equation. The reduct 
F' (G Int) Bool also matches the second equation but is not apart 
from the first equation. Hence, F' (G Int) Bool cannot simplify 
by either equation for F', and the local diamond property does not 
hold. Put simply, our apartness implementation is not preserved by 
reduction. 

In a terminating system, we are able to get away with the weaker 
Property 14 for apart (where apartness is not directly preserved 
under reduction), which our implementation does satisfy. We have 
designed an implementation of apart which is provably stable 
under reduction, but it is more conservative and less intuitive for 
programmers. Given that this alternative definition of apart brought 


6 Actually, under parallel reduction; see (Weirich et al. 2011). 
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a proof of type safety only for potentially non-terminating but 
linear patterns (prohibiting our canonical example Equal), and 
that it often led to stuck targets where a reduction naively seemed 
possible, we have dismissed it as being impractical. We thus seek 
out a proof of type safety in the presence of non-terminating, non¬ 
left-linear axiom sets. 

8.2 Type families vs. functional dependencies 

Functional dependencies (Jones 2000) (further formalized by Sulz- 
mann et al. (2007b)) allow a programmer to specify a dependency 
between two or more parameters of a type class. For example, Kise¬ 
lyov et al. (2004) use this class for their type-level equality func- 

class HEq x y (b :: Bool) \ x y — > b 

instance HEq x x True 

instance ( b ~ False) => HEq x y b 

The annotation xy-) b in the class header declares a functional 
dependency from x and y to b. In other words, given x and y, we 
can always find b. 

Functional dependencies have no analogue in GHC’s internal 
language, System FC; indeed they predate it. Rather, functional de¬ 
pendencies simply add extra unification constraints that guide type 
inference. This can lead to very compact and convenient code, es¬ 
pecially when there are multiple class parameters and bi-directional 
functional dependencies. However, functional dependencies do not 
generate coercions witnessing the equality between two types. 
Hence they interact poorly with GADTs and, more generally, with 
local type equalities. For example, consider the following: 

class Same a b\ a —> b 
instance Same Int Int 
data T a where 
T1 :: T Int 
T2 -. -.T a 
data 5 a where 

MkS :: Same a b ^ b-t S a 
f :: T a —t S a -y Int 
f T1 (MkS b) = b 
f T2 s =3 

In the T1 branch of f we know that a is Int, and hence (via the 
functional dependency and the Same Int Int instance declaration) 
the existentially-quantified b must also be Int, and the definition 
should type-check. But GHC rejects f, because it cannot produce a 
well-typed FC term equivalent to it. Could we fix this, by producing 
evidence in System FC for functional dependencies? Yes; indeed, 
one can regard functional dependencies as a convenient syntactic 
sugar for a program using type families. For example we could 
translate the example like this: 

class F a ~ b => Same a b where 
type F a 

instance Same Int Int where 
type F Int = Int 

Now the (unchanged) definition of f type-checks. 

A stylistic difference is that functional dependencies and type 
classes encourage logic programming in the type system, whereas 
type families encourage functional programming. 


7 Available from http://0kmij.0rg/ftp/Haskell/types.html# 
HList. 


8.3 Controlling overlap 

Morris and Jones (2010) introduce instance chains, which obviate 
the need for overlapping instances by introducing a syntax for 
ordered overlap among instances. Their ideas are quite similar to 
the ones we present here, with a careful check to make sure that 
one instance is impossible before moving onto the next. However, 
the proof burden for their work is lower than ours—a flaw in 
instance selection may lead to incoherent behavior (e.g., different 
instances selected for the same code in different modules), but 
it cannot violate type safety. This is because class instances are 
compiled solely into term-level constructs (dictionaries), not type- 
level constructs. In particular, no equalities between different types 
are created as part of instance compilation. 

8.4 Full-spectrum dependently typed languages 

Type families resemble the type-level computation supported by 
dependently typed languages. Languages such as Coq (Coq devel¬ 
opment team 2004) and Agda (Norell 2007) allow ordinary func¬ 
tions to return types. As in Haskell, type equality in these languages 
is defined to include /3-reduction of function application and 1- 
reduction of pattern matching. 

However, there are several significant differences between these 
type-level functions and type families. The first is that Coq and 
Agda do not allow the elimination of their equivalents of kind *. 
There is no way to write a Coq/Agda function analogous to the 
closed type family below, which returns True for function types 
and False otherwise. 

type family IsArrow (a :: *) :: Bool where 
Is Arrow (a —» b) = True 
IsArrow a = False 

Instead, pattern matching is only available for inductive datatypes. 
The consistency of these languages prohibits the elimination of 
non-inductive types such as * (or Set, Prop, and Type). 

Furthermore, pattern matching in Coq and Agda does not sup¬ 
port non-linear patterns. As we discussed above, non-linear patterns 
allow computation to observe whether two types are equal. How¬ 
ever, the equational theory of full spectrum languages is much more 
expressive than that of Haskell. Because these languages allow un¬ 
saturated functions in types, it must define when two functions are 
equal. This comparison is intensional, and allowing computation 
to observe intensional equality is somewhat suspicious. However, 
in Haskell, where all type functions must always appear saturated, 
this issue does not arise. 

Due to the lack of non-linear patterns, Coq and Agda program¬ 
mers must define individual functions for every type that supports 
decidable equality. (Coq provides a tactic—decide equality— 
to automate this definition.) Furthermore, these definitions do not 
immediately imply that equality is reflexive; this result must be 
proved separately and manually applied. In contrast, the closed type 
family Equal a a immediately reduces to True. 

Similarly, functions in Coq and Agda do not support coincident 
overlap at definition time. Again, these identities can be proven as 
lemmas, but must be manually applied. 

8.5 Other functional programming languages 

Is our work on closed type families translatable to other func¬ 
tional programming languages with rich type-level programming? 
We think so. Though the presentation in this paper is tied closely 
to Haskell, we believe that the notion of apartness would be quite 
similar (if not the same) in another programming language. Ac¬ 
cordingly, the analysis of Section 3 would carry over without much 
change. The one caveat is that, as mentioned above, non-linear pat¬ 
tern matching depends on the saturation of all type-level functions. 
If this criterion is met, however, we believe that other languages 
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could adopt the surface syntax and behavior of closed type families 
as presented here without much change. 

9. Conclusions 

Closed type families improve the usability of type-level compu¬ 
tation, and make programming at the type level more reminis¬ 
cent of ordinary term-level programming. At the same time, closed 
families allow for the definition of manifestly-reflexive, decidable 
equality on types of any kind. They allow automatic reductions of 
types with free variables and allow the user to specify multiple, po¬ 
tentially overlapping but coherent reduction strategies (such as the 
equations for the And example). 

On the theoretical side, the question of consistency for non¬ 
terminating non-left-linear rewrite systems is an interesting re¬ 
search problem in its own right, quite independent of Haskell or 
type families, and we offer it as a challenge problem to the reader. 

Acknowledgments 

We particularly thank Conor McBride, Joxan Jaffar, and Stefan 
Kahrs for helping us navigate the literature on term rewriting, 
and on unification over infinite types. Stefan Kahrs provided the 
counter-example to confluence. Thanks also to Jose Pedro Mag- 
alhaes for detailed and helpful feedback on the paper. 

This material is partly supported by the National Science Foun¬ 
dation under Grant Nos. 1116620 and 1319880. 

References 

M. Chakravarty, G. Keller, and S. Peyton Jones. Associated type synonyms. 
In ACM SIGPLAN International Conference on Functional Program¬ 
ming (ICFP’05), Tallinn, Estonia, 2005. 

Coq development team. The Coq proof assistant reference manual. LogiCal 
Project, 2004. URL http: //coq. inria.fr. Version 8.0. 

B. Courcelle. Fundamental properties of infinite trees. Theoretical com¬ 
puter science, 25(2):95—169, 1983. 

D. Fridlender and M. Indrika. Functional pearl: Do we need dependent 
types? Journal of functional programming, 10(4):409-415, 2000. 

R. Garcia, J. Jarvi, A. Lumsdaine, J. G. Siek, and J. Willcock. A compar¬ 
ative study of language support for generic programming. In Proceed¬ 
ings of the 18th annual ACM SIGPLAN conference on Object-oriented 
programing, systems, languages, and applications, OOPSLA ’03, pages 
115-134, New York, NY, USA, 2003. ACM. ISBN 1-58113-712-5. . 
URL http://doi.acm.org/10.1145/949305.949317. 

G. Huet. Resolution d'equations dans les langages d’ordre 1,2,... ,u. 

PhD thesis, Universite de Paris VII, 1976. 

G. Huet. Confluent reductions: Abstract properties and applications to term 
rewriting systems. J. ACM, 27(4):797—821, Oct. 1980. ISSN 0004-5411. 
. URL http: //doi. acm. org/10.1145/322217.322230. 

J. Jaffar. Efficient unification over infinite terms. New Generation Comput- 
ing, 2(3):207-219, 1984. ISSN 0288-3635. . URL http://dx.doi. 
org/10.1007/BF03037057. 

M. P. Jones. Type classes with functional dependencies. In G. Smolka, 
editor, ESOP, volume 1782 of Lecture Notes in Computer Science, pages 
230-244. Springer, 2000. ISBN 3-540-67262-1. 

O. Kiselyov, R. Lammel, and K. Schupke. Strongly typed heterogeneous 
collections. In Proc. 2004 ACM SIGPLAN Workshop on Haskell, Haskell 
’04, pages 96-107. ACM, 2004. 

J. Klop. Term rewriting systems. In Handbook of logic in computer science 

(vol. 2), pages 1-116. Oxford University Press, Inc., 1993. 

K. Knight. Unification: a multidisciplinary survey. ACM Comput. Surv., 21 
(1):93-124, Mar. 1989. ISSN 0360-0300. . URL http://doi.acm. 
org/10.1145/62029.62030. 

C. McBride. Faking it: Simulating dependent types in Haskell. J. Funct. 
Program., 12(5):375-392, July 2002. 


J. G. Morris and M. P. Jones. Instance chains: type class programming with¬ 
out overlapping instances. In Proceedings of the 15th ACM SIGPLAN 
international conference on Functional programming, ICFP ’ 10, pages 
375-386, New York, NY, USA, 2010. ACM. ISBN 978-1-60558-794-3. 
. URL http: //doi. acm. org/10.1145/1863543.1863596. 

M. H. A. Newman. On theories with a combinatorial definition of “equiv¬ 
alence”. Annals of Mathematics, 43(2):pp. 223-243, 1942. ISSN 
0003486X. URL http: //www. j stor. org/stable/1968867. 

U. Norell. Towards a practical programming language based on dependent 
type theory. PhD thesis, Department of Computer Science and Engineer¬ 
ing, Chalmers University of Technology, SE-412 96 Goteborg, Sweden, 
September 2007. 

T. Schrijvers, S. Peyton Jones, M. Chakravarty, and M. Sulzmann. Type 
checking with open type functions. In Proceedings of the 13th ACM 
SIGPLAN international conference on Functional programming, ICFP 
’08, pages 51-62, New York, NY, USA, 2008. ACM. ISBN 978-1- 
59593-919-7. . URL http://doi.acm.org/10.1145/1411204. 

1411215. 

M. Sulzmann, M. M. T. Chakravarty, S. Peyton Jones, and K. Donnelly. 
System F with type equality coercions. In Proceedings of the 2007ACM 
SIGPLAN international workshop on Types in languages design and 
implementation, TLDI ’07, pages 53-66, New York, NY, USA, 2007a. 
ACM. 

M. Sulzmann, G. Duck, S. Peyton Jones, and P. Stuckey. Understanding 
functional dependencies via constraint handling rules. Journal of Func¬ 
tional Programming, 17:83-130, Jan. 2007b. 

W. Swierstra. Data types a la carte. J. Funct. Program., 18(4):423—436, 
July 2008. ISSN 0956-7968. . URL http://dx.doi.org/10.1017/ 
S0956796808006758. 

S. Weirich and C. Casinghino. Arity-generic datatype-generic program¬ 
ming. In Proceedings of the 4th ACM SIGPLAN workshop on Program¬ 
ming languages meets program verification, PLPV ’ 10, pages 15-26, 
New York, NY, USA, 2010. ACM. ISBN 978-1-60558-890-2. . URL 
http://doi.acm.org/10.1145/1707790.1707799. 

S. Weirich, D. Vytiniotis, S. Peyton Jones, and S. Zdancewic. Generative 
type abstraction and type-level computation. In Proceedings of the 38th 
annual ACM SIGPLAN-SIGACT symposium on Principles of program¬ 
ming languages, POPL ’ll, pages 227-240, New York, NY, USA, 2011. 
ACM. 

S. Weirich, J. Hsu, and R. A. Eisenberg. Towards dependently typed 
Haskell: System FC with kind equality. In Proceedings of the 18th ACM 
SIGPLAN International Conference on Functional Programming, ICFP 
•13, Boston, MA, USA, New York, NY, USA, 2013. ACM. To appear. 

B. A. Yorgey, S. Weirich, J. Cretin, S. Peyton Jones, D. Vytiniotis, and J. P. 
Magalhaes. Giving Haskell a promotion. In Proc. 8th ACM SIGPLAN 
workshop on Types in Language Design and Implementation. TLDI ’ 12, 
pages 53-66. ACM, 2012. 


A. Description of units package 

Using closed type families, we have written a library units, 8 for 
strongly-typed dimensional analysis. For example, we want to write 
functions like this: 

curPos :: Pos -A Velocity —> Acceleration -a Time —» Pos 
curPos xo v a t = xo .+ (v .* t) .+ (0.5 *. a .* (t." pTwo)) 
The above code works with our library and type-checks. However, 
if we were to make an expression that does not respect physical 
units (say, by forgetting the t in the v .* t), we get a type error 
at compile time. For that particular case, the error says Couldn’t 
match type ’Meter’ with ’ Second’, rather helpfully. 

Importantly, this library is fully extensible. There are no wired- 
in units, except for Scalar. This way, users can apply the library to 
situations beyond just physics. For example, it might be sensible to 


1 cabal install units; you will need GHC 7.8. 
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have HPixel and VPixel units when writing a drawing program, to 
make sure that you don’t ever add a height with a width. 

In order to support extensibility, new units are represented by 
new datatypes, in kind *. For example, here are the definitions for 

data Meter = Meters 
instance Unit Meter where 
type BaseUnit Meter = Canonical 
data Foot = Feet 
instance Unit Foot where 
type BaseUnit Foot = Meter 
conversion Ratio _ = 0.3048 
type Pos = MkDim Meter 

It is the library’s extensibility that requires closed type families. 
It needs to reason about type-level structures without being able 
to enumerate all the possibilities, and without requiring the user to 
be well-versed in type families. There are two independent ways 
that closed type families are required in the design of the library: 
manipulating dimension specifications as type-level sets (which is 
similar to the example in Section 2.4) and managing the hierarchies 
of inter-convertible units. 

Building a hierarchy of units with a distinguished root In the 
definition of Meter and Foot above, we also defined their relation¬ 
ship. The code says that Meter is a canonical unit—that is, it is 
not defined in terms of something else. On the other hand, Foot is 
defined in terms of Meter, so that we can write code like 
height:: Double 

height = (1.8 % Meters) # Feet 
and height will have the value 5.9055. Of course, we could convert 
feet to meters simply by reversing the statement. 

Say a library has been written on top of units that defines 
several different length measurements, such as Meters, Feet, and 
LightYears. Now, a user of that library realizes that she needs to 
define Inches. She would like to define inches in terms of Feet, 
because she knows that conversion ratio. But, she doesn’t know 
which of the existing length units is the canonical one. Part of the 
design principle behind the units library is that she does not need 
to know—she can define Inches in terms of any of the available 
length units. 

With this design in hand, we still need a way to compute the 
conversion from our internal representation of a length—which 
will be in Meters, the canonical unit—to Inches. We can see that 
the declared units form a tree, rooted at Meters, and each new 
unit refers to its BaseUnit, or parent in the tree. To find the right 
conversion ratio, we simply have to walk up the tree from the 
desired unit, multiplying all of the conversion ratios together. 

But, how to implement this in Haskell? Recall that this tree 
is a tree of types, which are erased at runtime. We should use a 
class Unit that defines the conversion ratios, and we can have an 
associated type BaseUnit the defines a unit’s parent in the tree. We 
introduce an empty type Canonical to serve as a canonical unit’s 
(i.e.. Meter’s) parent, or BaseUnit. Then, we can (seemingly) 
implement the conversion ratio calculation straightforwardly: 
class (Unit (BaseUnit u)) => Unit u where 
type BaseUnit u :: * 
conversion Ratio :: u —> Double 

— ratio from u to u’s parent 
canonicalConvRatio :: u -» Double 

— ratio from u to canonical unit, 

— with default implementation 
canonicalConvRatio u 


= (conversionRatio u) * 

(canonicalConvRatio (_L :: BaseUnit u)) 

(The instance for the Canonical type breaks the recursion in 
canonicalConvRatio by overriding the default definition.) 

There is a major problem with Unit as defined here—it has 
a superclass cycle. The header states that every Unit’s BaseUnit 
must also be a Unit, which is clearly ill-founded. Yet, this idea is 
sensible, because we need to be able to call canonicalConvRatio 
on a BaseUnit. What to do? 

The full answer would take up too much space to describe (and 
is available if you download the units package), but it boils down 

type family CheckCanonical ( unit:: *) :: Bool where 
CheckCanonical Canonical = True 
CheckCanonical unit = False 
Using CheckCanonical, we can define a conditional constraint, 
essentially saying that every non-canonical unit must have a unit as 
its parent. This breaks the type-level recursion and brings us back 
onto solid footing. 

It is never wise to say that an alternate encoding is impossible 
in Haskell, but we were unable to find another one that works 
smoothly and presents a very easy interface to users. 

B. zipWith with inferred arity 

Using the CountArgs closed type family from Section 2.3, we can 
define a variable-arity zipWith function that infers the correct arity 
from its first argument. 

We first need a definition of the natural numbers. This definition 
will only be used as a promoted data kind. 
data Nat = Zero \ Succ Nat 

In our description, we will abbreviate these unary numbers with 
ordinary decimals. 

What will the type of our final zipWith be? It will first take a 
function and then several lists. The types of these lists is determined 
by the type of the function passed in. For example, suppose our 
function f has type Int —> Bool —> Double, then the type of 
zipWith should be ( Int —> Bool —>■ Double) —> [Int] —r 
[ Bool] [ Double]. Thus, we wish to take the type of the function 
and apply the list type constructor [ ] to each component of it. 

Before we write the code for this operation, we pause to note 
an ambiguity in this definition. Both of the following are sensible 
concrete types for a zipWith over the function f: 
zipWith :: (Int —»• Bool —»• Double) 

—> [Int] —> [Bool —r Double] 
zipWith :: (Int -¥ Bool —> Double) 

—¥ [Int] —> [Bool] —¥ [Double] 

The first of these is essentially map\ the second is the classic 
function zipWith that expects two lists. Thus, we must pass in the 
desired number of parameters to apply the list type constructor to. 
(The inferred arity comes in later.) The function to apply these list 
constructors is named Listify: 

type family Listify (n :: Nat) arrows where 
Listify Zero a = [a] 

Listify (Succ n) (a —» b) = [a] — > Listify n b 
We now need to create some runtime evidence of our choice 
for the number of arguments. This will be used to control the 
runtime operation of zipWith —after all, our function must have 
both the correct behavior and the correct type. We use a GADT 
NumArgs that plays two roles: it controls the runtime behavior as 
just described, and it also is used as evidence to the type checker 
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that the number argument to Listify is appropriate. After all, we 
do not want to call Listify 2 ( Int — > Bool), as that would 
be stuck. By pattern-matching on the NumArgs GADT, we get 
enough information to allow Listify to fully reduce, 
data NumArgs :: Nat — > * — > * where 

NAZero :: NumArgs Zero a 

NASucc :: NumArgs n b -4 NumArgs (Succ n) (a -4 b ) 
We now write the runtime workhorse listApply, with the fol¬ 
lowing type: 

listApply :: NumArgs n a -4 [a] -4 Listify n a 
The first argument is the encoding of the number of arguments 
to the function. The second argument is a list of functions to 
apply to corresponding elements of the lists passed in after the 
second argument. Why do we need a list of functions? Consider 
evaluating zipWith (+) [1, 2] [3,4], where we recur not only on 
the elements in the list, but on the number of arguments. After 
processing the first list, we have to be able to apply different 
functions to each of the elements of the second list. To wit, we need 
to apply the functions [(1+), (2+)] to corresponding elements in 
the list [3,4], (Here, we are using Haskell’s “section” notation for 
partially-applied operators.) 

Here is the definition of listApply: 

listApply NAZero fs = fs 

listApply (NASucc na) fs = 

\args —> listApply na (apply fs args) 
where apply :: [a -4 b] —> [a] —t [b] 

apply (f :fs ) (x : xs) = (f x : apply fs xs) 
apply _ _ = [] 

It first pattern-matches on its first argument. In the NAZero case, 
the list of functions passed in has 0 arguments, so we just return 
them. In the NASucc case, we process one more argument (args), 
apply the list of functions fs respectively to the elements of args, 
and then recur. Note how the GADT pattern-matching is essential 
for this to type-check— the type checker gets just enough informa¬ 
tion for Listify to reduce enough so that the second case can expect 
one more argument than the first case. 

Inferring arity As explained in Section 2.3, here is the closed 
type family that counts the number of arguments in a function type: 
type family CountArgs (f :: *) :: Nat where 
CountArgs (a -4 b) = Succ (CountArgs b) 

CountArgs result = Zero 


zipWith :: V f. CNumArgs (CountArgs f) f 
=> f -4 Listify (CountArgs f) f 
zipWith fun 

= listApply (getNA :: NumArgs (CountArgs f) f) (repeat fun) 
The standard Haskell function repeat creates an infinite list of its 
one argument. 

The following examples show that zipWith indeed infers the 

example 1 = zipWith (A) [False, True, False] [True, True, False] 
example 2 = zipWith ((+) :: Int -4 Int -4 Int) [1,2,3] [4,5,6] 
concat:: Int -4 Char 4 Double -4 String 
concat a b c = (show a) -H- (show b) -H- (show c) 
example 3 = zipWith concat [1,2,3] [’a’, ’b’, ’c’] 
[3.14,2.1728,1.01001] 

In example 2 , we must specify the concrete instantiation of (+). 

In Haskell, built-in numerical operations are generalized over 
a type class Num. In this case, the operator (+) has the type 
Num a => a -4 a -4 a. Because it is theoretically possible 
(but deeply strange!) for a to be instantiated with a function type, 
using (+) without an explicit type will not work—there is no way 
to infer an unambiguous arity. Specifically, CountArgs gets stuck. 
CountArgs (a -4 a -4 a) simplifies to Succ (Succ (CountArgs a)) 
but can go no further; CountArgs a will not simplify to Zero, be¬ 
cause a is not apart from b -4 c. 


C. Typing judgments for System pFC 

| hgnd £ | Ground context validity 

:- Gnd_Empty 


b gnd £ tf#£ 
b gnd £, H:k -4 * 
b gnd £ F#£ 


Gnd.Ground 


b gnd Y.,F(k):k' 
F(k):k' <E £ 
£;aiKhfrp:K C#£ 

£; oc.K by v : K' f| nd £ 
b gnd £, C:fp], F(p) ~ r,' 


£ bar A Variables context validity 


We still need to connect this type-level function with the term- 
level GADT NumArgs. We use Haskell’s method for reflecting 
type-level decisions on the term-level, type classes. The following 
definition essentially repeats the definition of NumArgs, but be¬ 
cause this is a definition for a class, the instance is inferred rather 
than given explicitly: 

class CNumArgs (numArgs :: Nat) (arrows :: *) where 
getNA :: NumArgs numArgs arrows 
instance CNumArgs Zero a where 
getNA = NAZero 
instance CNumArgs n b =$■ 

CNumArgs (Succ n) (a b) where 
getNA = NASucc getNA 

Note that the instances do not overlap; they are distinguished by 
their first parameter. 

It is now straightforward to give the final definition of zipWith, 
using the extension -XScopedTypeVariables to give the body of 
zipWith access to the type variable f: 


bgnd £ 

£ bar • 

£;Ab v t:k x#A 


£ b var A, x:t 
£ bar A a # A 


£ b ar A, au 

btx T | Context validity 


Var_Empty 

Var.TermVar 

Var.TypeVar 


rbn 


£ bar A 
b ctx £; A 


Ctx Valid 


Expression typing 


x:t € A b ctx £; A 
£; A bm £ : t 

T, x:tl bm e : 42 
T bm Xx:Ti.e : n -> T2 


Tm_Var 

Tm_Abs 
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r htm ei : n -» r 2 £ Km e2 : n 
r him ei e 2 : r 2 


Tm App 


r.QIKhm e IT 
r htm A a:n. e : V ol'.k.t 


Tm.TyAbs 


r him e : V cc.k.tz r hty ri : k 
r hf m en : r 2 [n/a] 


Tm TyApp 


Chi' <E E - jp ^ [A:k ]. 
hgnd s ? = _ t' ~ V 0 fai] 

V j < *, no_conflict(1',«, t/i, j) 
EFC[F(t)]-C[t'] 

Figure 8. The type rewriting rule 


r h-co 7 : T1 ~ T2 r htm e : ' 
T hfm e > 7 : r 2 
T hty r :~/tj Type kinding 

a-.K € A h ctx E; A 


E; A hty a : k 


F(k):k' € E bet. E; A 

E;A ht yT-.K 


H : 


E; A hty F(t) : k’ 

7 * € S hctx E; A 
E; A h ty H : k -a * 

r hty n : * r hty T2 : * 

r hty Ti r 2 : * 


Ty_Var 

Ty.TyFam 

Ty Ground 

Ty_Arrow 


The second fact above is immediate from the fact that the vari¬ 
able /3 must not be free in <j>, invoking the Barendregt variable con¬ 
vention and noting that /3 is introduced separately from any of the 
variables in scope in <f>. 

Thus, we must only show V j < i, no_conflict(1', «, 

Thus, given j < i ("and knowing no conflictfl, i.T.j)), we must 
show no_conflict('P, i, r[a/P], j). We proceed by case analysis on 
no_conflict(\F, i, r,j ): 


Case NC A pari: We must show only that a part (p,, p, [t[< 7 / d\/a,]), 
assuming apartfph, p, [r/a,]). The result is immediate after in¬ 
voking Property 12, with Q ffrand noting that f) cannot 

be free in ~pi- 

Case NC-Compatible: We note that r appears nowhere else in 
the premises of this rule. Therefore, changing r has no effect, 
and we are done. 


T, a:K h[y r : * 
T h[y V cc.k.t : * 


Ty_Forall 


□ 


r hty Ti : Ki —> K2 r hfy r 2 : Ki 
r hty T1 T2 : K1 


Ty_App 


D. Proof of substitution lemma 


E. Proof of consistency 

As described in Section 5.3, we use a rewrite relation, defined in 
Figure 8, show that it is complete with respect to E; A h co 7 : 
n ~ r 2 , and then conclude that E must be consistent, as rewriting 
preserves non-type-family head forms. 


The kinding judgment for types, the proposition validity judgment, 
and the context validity judgments are all mutually recursive. They 
all support a standard substitution lemma, which we do not prove 

Lemma 20 (Type substitution). Assume r hty <r : k. Then, the 
following are true: 

1. 7/r, ar.K, A hty t : k, then F, A[cr/a] hty r[o/a] : k. 

2. //h ct x r, a-.K, A, then h- ctx T, A[tr/a]. 

3. If r, a-.K, A hprop <f> ok, then T, A [<j/a] h prop <j>[o/a\ ok. 

Lemma (Co_Axiom Substitution [Lemma 15]). // E: A, (3\k, A' h^ 
C[i\ T : F(p^m) ~ V iffc] and S;A h ty a : k, 
then E; A, A'\<r/0\ h co C[i\ t[<t/0\ : F(pi[T/<*][<?/ft) ~ 

]wm- 

Proof. We invert E;AJi«,A' % : F( Pi [rJ(Xi]) ~ 

Vi\r/ai\ to get the following: 

• CiV € E 

• 9 = [ma]. fCy* 

• E^liKyA' hty r 

• h tx E; A, P'.k, A' 

• V j < i, no-conflictfl/ i,f, j) 


Lemma20gives SpA^^ and h ctx E; A, A'[cr//3], 
Let (j) = F(p) ~ v. It now remains only to show that V j < 
i, no_conflict(1', i, t[<j/ f)\,j) and0[r/ai][cr//3] = 0[r[cr//3]/ai], 
and then we can use Co_Axiom to get the desired result. 


Type contexts Throughout this proof, we use a notion of type 
contexts, or types with holes. The notation C\f\ denotes a type 
with exactly one hole in it. Similarly, C[-J denotes a type with any 
number of holes (possibly 0) in it. We generalize these definitions 
to lists, saying that £ [•] denotes a list of types with exactly one hole 
(in one specific type, not one hole per type) and that £[•] denotes 
a list of types with any number of holes. 

E.l Rewrite relation 

The only form of reduction is type family simplification, using 
the same no_conflict judgment that appears in the Co_Axiom 
rule. The use of C[\ in the conclusion states that a type family 
application can reduce anywhere within the structure of a type. 
As C[-] denotes a type context with exactly one hole, only one 
type family reduction happens in one step. Note that this rule is 
nondeterministic. 

We use the notation E h o\ -"F 0-2 to mean the reflexive, 
transitive closure of the relation E F • ■w •. We write single-step 
joinahility of 01 and <r 2 as E h <71 <=> <r 2 ; this fact holds whenever 
there exists 03 such that E h 01 03 and E h cr 2 <73, or 

E h 01 <7 2 , or E h cr 2 a\, or cri = cr 2 . General joinahility 

is written E h 01 cr 2 ; this fact holds whenever there exists 03 
such that E h 01 03 and E h cr 2 <73. 

We generalize the relation to hold over lists of types, written 
E h t a, to say that the list <7 is identical to the list T except for 
one element which takes one step. We also say Ehf^>*ff, which 
is identical toEhr o. 

Definition 21 (Confluence). Our rewrite system is confluent if, for 
all 00 , cri, and cr 2 such that E F cri and E h <70 cr 2 , 

E -k <7 1 cr 2 . 
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In order to show the completeness of the rewrite relation for 
transitivity coercions, we need to show the transitivity of the join- 
ability relation—that is, that E b ai <=>* 02 and E b <72 <73 

implies E b <71 <73. This fact requires confluence of the rewrite 

system. 

E.2 Local confluence 

Newman’s lemma (Newman 1942) states that a terminating rewrite 
system is confluent if it is locally confluent. 

Definition 22 (Local confluence). Our rewrite system is locally 
confluent if, for all 00, 01, and 02 such that S' :p < 7 o 01 and 

E b cro <72, then E I- <71 O* <72. 

A diagrammatic presentation of different confluence properties 
is in Figure 6. 

Because we have assumed termination, we need only show local 
confluence to show confluence. As usual, we will need a small 
menagerie of supporting lemmas before we can get to the main 

Lemma 23 (Stability of choice of substitution of lists). Ifr[o/a] = 
t[o' /oi\ and all the a are free in r, then a = a'. 

Proof. By induction on the structure of t: 

Case t = a: It must be that Spfe a, a one-element list. Thus, we 
know that <7 = <7 and o’ = o'. The given equality reduces to 

Case r = < 7 i —» <72: Divide the variables a into three groups: 

• are the variables free in <71 but not free in <72, 

• /J2 are the variables free in <72 but not free in <71, and 

• /?3 are the variables free in both <71 and <72. 

Divide <7 and o' accordingly. Then, we can use the induction 
hypothesis to get that <71. <73 s= cr\. o' a and that 0-2, <73 = 
<72, <73. Thus, we can conclude that <7 — o' as desired. 

Cases r = V a:K.v, t = v 1 V2, and r = F(v): Similar. 

Case t = H: The list of variables a must be empty, as must be <7 
and <7', so we are done. 

□ 

Lemma 24 (Stability of choice of substitution of lists in lists). If 
r[cr/a] = r[<7'/a] and all the a are free in T, then o = o'. 

Proof. By induction on the length of r, appealing to Lemma 23 and 
using logic as above to manage the free variables. □ 

Lemma 25 (One step/one hole context substitution). If E 
t' , then E h C[t] C[t% 

Proof. Straightforward induction on the structure of C\-\. □ 

Lemma 26 (One step/many holes context substitution). If B p 
r t', then E I- Cfr\ C[r']. 

Proof. Straightforward induction on the structure of C[-J. □ 

Lemma 27 (Multistep/many holes context substitution). If E F 
r t', then E h C[r] C\t'\. 

Proof. Straightforward induction on the length of the reduction 
E t- r r', appealing to Lemma 26. □ 

Lemma 28 (One step/one variable substitution). If E h r r', 
then E I- o[r/a] <7[r'/a]. 

Proof. By Lemma 25. □ 


Lemma 29 (One step/list of variables substitution). //E I- r 
then E h cr[r/a] o[t'/ a]. 

Proof. Straightforward induction on the list 7, using Lemma 28. 

□ 

Lemma 30 (Multistep/list of variables substitution). If E h 
then E h <7 [7/a] o[t'/o\. 

Proof. Straightforward induction on the length of the reduction 
E h 7 Hgpl', appealing to Lemma 29. □ 

Lemma 31 (One step linear type pattern anti-substitution). If a is 
the set of free variables in linear pattern p and E I- p[o/a\ 

'$p'].cx], then Eh?^ o'. 

Proof. By induction on the structure of p, where the linearity as¬ 
sumption is needed when dividing up the variables and combining 
the results when appealing to multiple induction hypotheses. □ 

Lemma 32 (Multistep linear type pattern antr-substitution). If a is 
the set offree variables in linear pattern p and E h p[o/a\ p[<7'/a], 

then Eh o o'. 

Proof. By induction on the length of the reduction E h p[o/a ] p[<7'/a], 

appealing to Lemma 31 in the inductive case and Lemma 23 in the 
base case. □ 

Lemma 33 (Multistep type pattern anti-substitution). If<x is the 
set of free variables in pattern p and E h p\o/a\ p[<7'/a], then 
Eb?%. ' 

Proof. Let p' be the result of replacing all variables in p with fresh 
variables. Thus p' is a linearized version of p. Let the set of free 
variables in p' be a'. We can see that for some list of types ip, 
p[cr/a] = p'[^/cC]. (The list of types i/> is just like <7 but with some 
repetitions to account for the linearization.) Similarly, we have 
p[o'/a] S* jjljBf gg . Thus, we know E h p'[ip/a'] * 

We then appeal to Lemma 32 to get E I- ip~** if/. Recall that 
this notation means that Thus, we can conclude that 

E h <7 o’ (because each ip and ip' has an equal o or o') and 
then E h o'. ‘ □ 

Lemma 34 (Local confluence). If Good E, the rewrite relation 
E ;§- • • is locally confluent. 

Proof. We assume E h cro <7i and E h <70 cr 2 and we must 
find <73 such that E h <71 <73 and E h <72 <73. We proceed 

by induction on the structure of oq. 

Case <70 = Ti — ¥ T2’. Inverting E I- <70 <71 and E h <70 ^ 02 

tells us that (n —y T2) = C\ [/*\Ct 7 )] and (n —>■ T2) = 
C2[F2(ip2)\, with <71 = Ci[tpi\ and 02 = C2[ip2\- We now do 
case analysis on Ci [•] and C2 [•]: 

Case Ci [•] = Ci[-]_]_-)• t 2 ,C 2 [-\ = &[■] —y r 2 : Note that Cj [Li(^j)] W 
T1 = Ci7*2 (l< 2.)]• Therefore, using the other conditions 
known from inverting the original steps from <70, we know 
that E m and E * • n ti 2 , where m = C[ [ip'x\ 

and T12 = Ci[r/4], Use the induction hypothesis to get T13 
such that E I- m ■w* T13 and E h T12 713. Then, by 

Lemma 27 to lift this result back to n —r T2, we are done, 
showing that <73 = T13 —> T2. 

CaseC[-] = Cl [•] r 2 ,C 2 [-] = n -»■ Ci[-]: Let r| = C[[ipi] 

and rl = Ci[Di], Then, o\ = t[ —> T2 and <72 = n —» ri 
with E h n ^ rj and E h 72 -*> rl. We let <73 = r[ —¥ Ta, 
and we are done. 
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Other cases: Similar to the cases above. 

Case <7 0 = V o/.k.t: Similar to the case for n —> n. 

Case cto = ri T 2 ‘. Similar to the case for n —> T 2 . 

Case cto = F(v): Inverting S h cto ^ cti and E b cto <72 
gives us cto = C'[F'(t)] and ct 0 = %"[F" (?*)]. If * ' 
and C"[] 7 •, then we are in a case similar to the case for 
n —► T 2 , and we simply use induction. Otherwise, we are left 
with three cases: 

Case C '[•] = F((C[-]),C"[-] = •: In this case, |bSs <C[F'(t)]. 
Let t' be the top-level reduct of F'(t). Thus, cti = 
F((C[t']). Let v 7 = <E[t']. 

We also know that E b F(v) CT 2 by a top level reduction. 

Inverting gives us the following: 

• Ch'P J^E_ 

• >fb== 

• v - 

• CT2 = <Ti[lp/0ti] 

• Vj < i, no_conflict('&, i, ip,j) 

We want to find a common reduct of ct 2 (which might not 
be headed by F) and F(v'). Thus, we must find a way to 
reduce F{v’) at the top level. We now use Property 14 to 
get v" such that Ebb v", v" — fi'(p,) f° r some fi', 
and for every p' such that apart(p', v), apartfpb v"). 
Instead of reducing F(v') directly, we step F(v') to F(v") 
(getting S b F{v')~>* F(v") from repeated application 
of Lemma 27) and then show that F(y") can reduce at the 
top level by the same equation at F\S) reduced to form < 72 . 
Thus, we must prove that v" = pi[ip' /a,] (for some if)') 
and that, for all j < i, no conflict('P, t,ip',j). 

We know that v" = fillfH). We also know that, by assump¬ 
tion, the free variables in v" are distinct from the free vari¬ 
ables in ~fh. Thus, O' must map every free variable in ~pl to 
some other type. Thus, we have v" = p,\ip'/a.,] for the ip' 
taken from the range of O'. 

We then perform inversion on the known facts that, for all 
j < i, no_conflict(T r , We now fix j, and repeat this 

argument for all j < i: 

Case NC_Apart: We see that apart(j%,jp,-[^/« t ]). From 
Property 14, we see that apart(p7, pi [ip 1 /ai ]) as desired. 
Case NC_Compatible: The check com pat ('!'[*],'!> §||;- 
does not depend on the types ip or ip', and thus we are 

Thus, F(v") reduces at the top level to <73 =f$M§lp|ti]. 
It remains to show that <72 (which equals <j[[ip / on]), the 
initial top-level reduct of Fte) reduces to (73. We know that 
E b pi[ip/oci]™** pi{ip'/&i\. Thus, by repeated application 
of Lemma 33 (and appealing to clause 2 of Good to show 
that every ip |r ip is considered), we get E b ip ip'. By 
Lemma 30, we can conclude E b <j'i[ip/ai\ a'i[ip'/ai\, 

CaseC'[-] = -,£"[•] = F((F." [■])•. Similar to the case above. 
CaseC'[-] = C"\-\ = •: We will show a stronger property 

than local confluence in this case; we will show that if 
E b F(t) vi and E b F(r) V 2 , both at the top 
level, then vi = u 2 . 

We invert both reductions to get the following facts, along 
with h| nc | E: 


from E b F(r) vi 

from EbF(r)-t « 2 

€ E 

C I 2 :'I , 2 € E 

T'l = [ai:«i]. F(pf) ~ ct( 

^2 = [a 2 :K2 ]- Ffpf) ~ a' 2 

r = ’ 

t = 

vi= 'i&mm 

V2 = 

Vfe < i, 

Vfc < j , 

no_conflict(^i, 

no_conflict (^ 2 ,j,ip 2 ,k) 


Thus, we must show that ct(, 02 j §1 %fot 2 j]. 

From clause 3 of Good, we see that either i = j = 0 (open 
family) or Ci = C 2 (closed family). We will tackle these 
cases separately: 

Open family: In this case, the axioms C\ and C 2 have 
one equation each and thus we simply drop the i and j 
subscripts. Let $1 and $2 be the the equations of C\ and 
C 2 , respectively. 

We know from the inversions that pi [ipi/ai] = pi\}pila.^. 
Let 0 2 — f-+ i/’ 2 ] We can see that 0 2 is a 

unifier of pT and JF>- Then, clause 4 of Good tells us that 
compat(4>i, 4> 2 ). Here, we have two cases: 

Case Compat.Coincident: We know that fi is a most 
general unifier of pT and p 2 (appealing to Property 11) 
and 0 (ctJ) = 0(ct 2 ). Thus, there must be some O' such 

that O 2 = O' o O. _ _ 

We must show that a\\ipi/oti\ = ct 2 [ip 2 / 02 ]. This 
equation is equivalent to 0 2 (cri) = f^cr^), which in 
turn is 0'(0 (cti)) = 0'(0(ct 2 )). But, we know that 
O(ctJ) = 0 ( 172 ), so we are done. 

Case COMPAT Distinct: We know that unifyfpT, pi) fails. 
Yet, we have 0 2 as a unifier of these types. Appealing to 
Property 40, we have a contradiction, and thus this case 
cannot happen. 

Closed family: We know C\ = C 2 and, by h^ n d E, there 
can be only one axiom of the same name in the context, so 
'Pi = 'P 2 , and thus we can drop the 1 and 2 subscripts, 
except on the ip, which do not appear in the axiom types. 
Thus, we must show-a^f^ya,] - a'j[ip 2 /a.j\. 

Now, we must examine the indices i and j. If i = j, 
then we are done by an application of Lemma 24, using 
p[ipi/a ] = p[ip2/a\ and clause 2 of Good. So, we as¬ 
sume, without loss of generality, that i > j. Inverting 
no conflict('P, j) leads us to three cases: 

Case NC Apart: We see here that apart]pj, pi[ipi/ai]). 
Yet, we know from the original inversions that p, [ipi /«,] = 
Pj[ip2/oij]. The substitution [a, ip 2 ] is then a unifier 
of the two types that we know are apart, leading to a 
contradiction, appealing to Property 13. Thus, this case 
cannot happen. 

Case NC_Compatible/Compat_Coincident: Here, 
we know that fi is a most general unifier (appealing to 
Property 11) for pi and pj and that fi(a' t ) = fi(a'). 
From the original inversions, we know Pi[ipi/ai] =' 
Pi\ip 2 / Let O 2 = [a* ipi, otj i-» ip 2 ]• We can say 
0 2 = fi’ o fi for some O'. We can rewrite our goal as 
showing that 0'(0(cr')) = 0'(0(cr')). This is imme¬ 
diate from the fact that O(ct') = O(ct' ), and so we are 
done. 

CaseNC COMPATiBLE/COMPAT Distinct: We know 
un 'fyoo(Pij ft) foils- Yet, we know from the original in- 
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versions that pi[ipi/a.i\ = pfifiii/af]. The substitution 
[cm i -¥ ipi,aij i->- i/j 2 ] is then a unifier of p) and flj, lead¬ 
ing to a contradiction, appealing to Property 11. 

□ 

Lemma 35 (Confluence of terminating systems). If Good E and 
E fr • • is a terminating rewrite relation, then it is confluent. 

Proof. By appealing to Newman’s lemma (Newman 1942) and 
Lemma 34. □ 

E. 3 From confluence to consistency 

Lemma 36 (Transitivity). //E h ■ ^ ■ is a terminating rewrite 
relation, Good S, E h n t 2 and E b t 2 T3, then E h 
n T3. 

Proof. By Lemma 35. □ 

Lemma 37 (Congruence). IfY, b n «=>* r 2> then E b C[ti] «=>* C[r 2 ], 

Proof. By appealing to Lemma 27. □ 

Lemma 38 (Completeness). If T. h is a terminating rewrite 
relation, Good E and E: A ‘ co " : cri ~ <7 2 , then E ! cri -44-* cr 2 . 

Proof. We proceed by induction on E; A bo 7 : cri ~ (72: 

Cases Co Arrow, Co Forall, Co App, and Co TyFam: 

By the induction hypothesis, appealing to Lemma 37 and 
Lemma 36. 

Cases Co_Refl, Co_Sym, and Co_Trans: From the fact that 
E b • 44* • is an equivalence relation, appealing to Lemma 36. 
Cases Co Left and Co Right: The induction hypothesis gives 
us that E b n r 2 44 * <7i 02 ■ We can see that any reduct of a type 
application must also be a type application. Thus, the common 
reduct must be Vi u 2 (for some v \ and vf) where v\ joins T\ 
and 01 and v-i joins r 2 and cr 2 . Thus, we are done. 

Case Co_Axiom: From claus e 1 of G ood and the Co_Axiom 
rule, we know that 01 = F(yi[p/oti\) and that cr 2 = v'i\p/ai\. 

We conclude that E b cri -^02, as the premises of the rule 
Red are all given by the premises of the rule Co_Axiom. 

□ 

Lemma 39 (Consistency). //E b • • is a terminating rewrite 

relation and Good E, then E is consistent. 

Proof. A consistent coercion equates two types with the same 
ground head forms. By Lemma 38, these two types must be join- 
able under the rewrite relation. Yet, the rewriting rule preserves 
all head forms except for type families. As type families are not 
ground head forms, we are done. □ 

F. Proof of properties of apart 

This appendix includes the proofs that our concrete definition of 
apart, as given in Definition 6, satisfies the properties stated in 
Section 5.1. Then, we show that these properties, along with the as¬ 
sumption of termination, imply the high-level (sanity-check) prop¬ 
erties from Section 3.2. It is well-founded to use our confluence 
result for these later proofs as those properties are not used any¬ 
where in other proofs—they simply serve as a higher-level check 
on our formal results. 


F.l Proofs of Properties 12-14 

We restate our implementation of apart: 

Definition (Apartness [Definition 6]). 
apart(p, t) = -iunify oc (p, flatten(r)) 

Recall that flatten (Definition 5) replaces all type family appli¬ 
cations in a (finite) type with fresh variables, maximally preserving 
sharing. That is, flattening the same type family application twice 
in the same type (or list of types) converts both applications to the 
same fresh variable. In order for flatten to be a well-defined func¬ 
tion, it must refer to a mapping from every possible type headed by 
a type family to fresh variables. This mapping is countably infinite, 
but we can assume, as usual, a countably infinite set of fresh vari¬ 
ables. Furthermore, we assume that the set of variables in the range 
of this mapping is distinct from variables used elsewhere (partic¬ 
ularly, in patterns). If this assumption is violated for some use of 
flatten, we simply rename the variables accordingly. 

The above definition of flattening with respect to an infinite 
mapping of type families to variables, means that flattening com¬ 
mutes with type constructors. For example, flatten(n —t r 2 ) = 
flatten(n) —t flatten(r 2 ). 

For completeness, we also restate the correctness of unification, 
but now for unify oc . 

Property 40 (unify ^ correct). If and only if there exists a substi¬ 
tution to (whose range may include infinite types) such that to (a) = 
co(t), then unify OQ (d ; ,r) succeeds, returning to. Furthermore, to is 
a most general unifier of a and r. 

Before getting to the properties themselves, we must prove 
some properties about flatten. First, we extend flatten to apply to 
substitutions and define an inverse operation: 

Definition 41 (Flattening a substitution). If 12 = [a t\, we 
say flatten (12) for [a i-> flatten (r)], where sharing is maximally 
preserved between the different types t. 

Definition 42 (Inverse flattening). We let flatten -1 denote the 
inverse operation to flattening, implemented by doing a reverse 
lookup in the map from type family applications to variables. 

Note that flatten -1 is a substitution, infinite in extent, but or¬ 
dinary in other respects. In particular, note that the elements in the 
range of flatten -1 are finite —that is, flatten -1 could be denoted 
by the metavariable 12. 

Lemma 43 (Flattened substitutions). For all type patterns p and 
substitutions 12, flatten (f2(p)) = (flatten (12) )(p). 

Proof. The pattern p contains no type families, so flatten does not 
affect the parts of p unchanged by the application of 12. Because 
flatten preserves maximal sharing, it must be the case that apply¬ 
ing a flattened substitution yields the same result as flattening an 
substituted pattern. This can be shown by straightforward induc¬ 
tion on p. □ 

Lemma 44 (Flattening commutes with substitution). For all 
12, there exists an 12' such that, for all r, flatten(12(r)) = 
12'(flatten(r)). 

Proof. We can say that 

flatten(12(r)) = flatten(12(flatten -1 (flatten(r)))) 

Because flatten -1 is a substitution, and appealing to Lemma 43 
(noting that flatten (r) is a pattern), we can rewrite this as flatten (12o 
flatten -1 ) (flatten(r)). Thus, we let 12' be the substitution flatten (12c 
fI atten -1 ) and we are done. □ 
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Lemma 45 (Flattening a list commutes with substitution). For 
all Cl, there exists an O' such that, for all t, flatten(fl(r)) = 
fi'(flatten(r)). 

Proof. By induction on the length of the list, appealing to Lemma 44. 

□ 

Property (Apartness is stable under type substitution [Prop¬ 
erty 12]). Ifapart(p,r), then for all substitutions Cl, apart(p, O(r)). 

Proof. Expanding definitions, we must show that 
-mnify^p, flatten (r)) 

implies 

-iunify oc) (p, flatten(fi(r))). 

We prove the contrapositive, that is, that unify ^ (p, flatten(0(r))) 
implies unify OQ (p, flatten(r)). Thus, we have a substitution ui 
such that w(p) = w(flatten(0(r))) and must find a to' such that 
w'(p) =w'(flatten(r)). 

By Lemma 45, we can say flatten(0(r)) = fl'(flatten(r)) for 
some O'. Then, choose to' = uj o Cl'. We can see that to'(p) = 
(flatten(r)) (noting that the variables in p are fresh from those 
in flatten(r)) as desired. □ 

Property (No unifiers for apart types [Property 13]). /fapartfp, r), 
then there exists no substitution Cl such that Cl(p) = O(r). 

Proof. Expanding definitions, we must show that ->unify(p, flatten(r)) 
implies -iunify(p, r). We will show the contrapositive. Thus, we 
assume Cl such that Cl(jj) = Cl(r) and we must find Cl' such that 
O'(p) = 0'(flatten(r)). 

Choose = flatten -1 . Because the free variables in p 
are distinct from the variables in the domain of flatten -1 , we have 
O'(p) = O(p). We also have 

Cl' (flatten (r)) = 0(flatten -1 (flatten(r))) = CI(t) 
and we are done. □ 

Property (Apartness can be regained after reduction [Property 14]). 

Ifr = O(p) and E I- t -** f>, then there exists a t" such that 

1. 

2. t" = Cl' (p) for some Cl', and 

3. for every p' such that apart(p', t): apart(p', r"). 

Proof. We know that f matches some pattern p and that one el¬ 
ement in t steps, forming t' . Suppose that one element is T k . 
Thus, E F Tk r(. Inverting this step relation gives us that 
Tk = C[F{y)\ and r' k = C[v'\, where F(v) reduces to v' at the 
top level. 

Define <C[-J to be the list of types r such that every occurrence 
of F(v ) is replaced by •. Thus, (C|[F(v)] = r. We choose r" 
(from the statement of the property) to be (Civ']. We must show 
the following: 

• Ehr'^’T": Straightforward application of the rule Red. 

O'(p) for some Cl': Because p cannot contain type fami¬ 
lies, it must be that Cl maps some variables to types containing 
F(v). Choose Cl' to be Cl with all occurrences of F(v) replaced 
by v' . Because all occurrences of F(v) in f have been replaced 
by v' , we can see that O'(p) must be r". 

• For every p 1 such that apart(p',r), we have apart(p',r"): 
Assume we have p' such that apart(p',r). Unfolding defi¬ 
nitions (and taking the contrapositive) gives us to such that 
u>(p*) = cu(flatten(r")), and we must find to' such that 
u'(pC) = w'(flatten(r)). 


Let a be the variable mapped from F(v). Thus, flatten(F(u)) = 
a. Let Do = [a >4 t/] and choose to' = to o O 0 . Noting that 
a does not appear in p', we see that to'(p') = to(p'). Now, we 
must only show that cc'(flatten(r)) = w(flatten(r")). By our 
choice of to', we know w'(flatten(r)) = w(f2o(flatten(T))), 
thus we must show f2o(flatten(r)) = flatten(r"). By its def¬ 
inition, flatten takes all occurrences of F(v) in T to a. Then, 

Oo takes all of these occurrences of a to v'. Since the only 
difference between r and t" is that all occurrences of F(v) 
are replaced by v', we can see that Oo (flatten (r)) is indeed 
flatten(r"), and we are done. 

□ 

F.2 Proofs of Properties 2 and 4 

Property (Apartness through substitution [Property 2]). //apartfp, r) 
then there exists no O such that match(p, O(r)). 

Proof. We shall prove by contradiction: assume O and O' such 
that Cl' (p) = O(r). We can simplify a bit and combine these 
substitutions, because the free variables of p are distinct from those 
in t; we can say Oo(p) = Oo(t). Then, this is a contradiction, 
appealing to Property 13, and we are done. □ 

The next property (Property 4) requires an important auxiliary 
lemma. 

Lemma 46 (Matching can be regained after reduction). TjfGood E 
and E F Cl(p)-^>* t then there exists an Cl' such that E h 
T^Cl'ip). 

Proof. Throughout this proof, we will consider types as abstract 
syntax trees. We will use “type” and “tree” interchangeably. 

Define the operation linearize to take a pattern and freshen all 
the type variables therein, thus producing a linear pattern. Our first 
step is to show that r matches linearize(p). How does Cl(p) step to 
r? It must be through a series of type family reductions. Because 
p does not mention type families, these type families must occur 
in Cl(p) at or beneath where variables appear in the tree p. Thus, 
as Cl(p) steps, the tree structure imposed by p does not change. 
However, it is possible that a type family application, say F(v) 
steps in two different ways throughout the tree Cl(p) as Cl(p) is 
reducing. Thus, we can claim only that r matches linearize(p), not 
p itself. 

When comparing the trees r and p, define a mismatch to be two 
locations in the respective trees where p has a repeated variable 
and r has two different suh-trees. Count only those matches that 
involve the left-most occurrence of a variable in p. We proceed by 
induction on the number of mismatches between p and r. 

Base case: If there are no mismatches, then we know that p must 
match r with a substitution Cl'. We are done. 

Inductive case: Choose the left-most mismatch. Say that the re¬ 
peated variable in p is a and the disagreeing types in t 
are oi and <72. We know that £ F Cl(p) r, and thus 
that E F fl(a) oi and E F fl(a) <72. By conflu¬ 
ence (Lemma 35), we know that there exists a <73 such that 
E F <7i ~»* (73 and E F (72 173. Let t' be r, except that 

both cri and 02 in r are replaced by 03 in r'. We know that 
E F r t' hy congruence of the rewrite relation and thus 
that E F Cl(p)~^>* t' . Thus, we can use the induction hypoth¬ 
esis to get O' such that EFt' Cl'(p). Then, by transitivity 
of E F • •, we are done. 

□ 
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Lemma 47 (Matching normal forms). If GoodE and E h 
f l(p) -vp* v where v is a normal form, then there exists Q! such 
thatv = fT(p). 

Proof. We apply Lemma 46 to see that there exists an Q' such 
that E I- v~^>* Cl'(p). But, we know that v cannot step, and thus 
« = «'(p). □ 

Lemma 48 (Longest reduction). Suppose Good E. For every type 
t and its normal form v (whose uniqueness is guaranteed by the 
combination of confluence and termination), there exists a number 
n such that all reductions from t to v are of length at most n. 

Proof Konig’s lemma states that every tree with infinitely many 
vertices, each having finite degree, has at least one infinite simple 
path. Here, we are considering trees of reductions, rooted at r. 
We will use the contrapositive of Konig’s lemma: that if every 
node in a tree has finite degree and all simple paths are finite, 
then there are finitely many vertices. For any type cr, there are 
finitely many types o' such that E h a o', because there 

are finitely many locations within cr that can be headed by a type 
family and finitely many equations that type family application 
might match. By termination, we know all simple paths in the tree 
of reductions are finite. Thus, the contrapositive of Konig’s lemma 
tells us that the tree has a finite number of nodes. Thus, we can 
simply enumerate all paths from r to v to discover the one with the 
longest path. This path’s length is our result n. □ 

Lemma 49 (Apartness and normal forms). //'apart(p, r) and E h 
t v where v is a normal form, then apart(p, v). 

Proof. Let the longest path from r to v be of length n (Lemma 48). 
We perform induction on n. 

Base case: Trivial. 

Inductive case: We know that r can step to some r'\ that is, 
E h r « f. We then appeal to Property 14 (choosing 
p = flatten(r), but the choice is irrelevant) to get t" such 
that Eh t' t" and apart(p, r"). By our assumption that 
n is the length of the longest path from r to v and the fact that 
E h r t" by at least one step, we know that the longest 
path from r" to v has length less than n. Thus, we can use the 
induction hypothesis, and we are done. 


□ 

Lemma 50 (Apartness implies no match). If apart(p, r), then 
->match(p, t). 

Proof. We prove by contradiction. Assume O such that Cl(p) = r. 
By the assumption that pattern variables are fresh, we can say 
Cl(p) = CI(t). Then, by Property 13, we have a contradiction. □ 

Property (Apartness through reduction and substitution [Prop¬ 
erty 4]). If apart (p,r), then for any t' such that r t': 
->match(p, t'). 

Proof. Let v he the unique normal form of r. By Lemma 49, 
we know apart (p,v). By Lemma 50, ^match(p, v). Note that 
the uniqueness of normal forms, we know E h r' v. By 
the contrapositive of Lemma 47, we see that ->match(p, r') as 
desired. □ 


G. Proof of compatibility soundness 

In this appendix, we show that the concrete implementation of 
compatibility (Definition 8) satisfies the definition of compatibility 
(Property 7). We use the implementation of compatibility included 
in our formal inference rules, as it separates Definition 8 into its 


mpat($i, $2) | Equation compatibility 


$1 = [«i:Kij. .F(pT) ~ 

$2 = [a-2-K2]. F(fid) ~ 
unify (pi,pd) = fi 
fi(oi) = n(u 2 ) 

compat(<E>i, $2) 


F(piitk 

= [a 2 :K 2 ]. F(p 2 ) 
ify(pT, P2) fails 


COMPAT_COINCIDENT 


Compat_Distinct 


We generalize Property 7 to work with unify^. 

Property 51 (Compatibility (with infinite unification)). Two type- 
family equations p and q are compatible iff to\(lhs p ) = u>2(lhs q ) 
implies ui(rhs p ) = ui2(rhs q ). 


Proof. For all type family equations $1 and $2, where $1 = 
!Oh:k.i]. -F(pT) ~ vi and $2 = [a 2 :feJ. Ffpd) ~ u 2 , we must 
show that compat('l>i, $ 2 ) implies that, for all wi and u>2 such that 
wi(pT) = w 2 (p2),itisthecasethatu>i(vi) = u> 2 (v 2). 

We have two cases: 


Case Compat.Coincident: Here, we know that w(pT) = 
w(p2) and, by Property 40, that w is a most general uni¬ 
fier. We further know that ui(v10(02). By assumption, 
wi(pi) = w 2 (p2). By the assumption that all patterns in type 
families have distinct variables, we know that the domains of ui 1 
and (02 are distinct. Thus, we can write to' = to 1 U 102, and say 
to'(pi) = to'(fjf). Similarly, we can say that we wish to show 
to'(v 1) = to'( u 2 ). Because to is a most general unifier, we can 
say that to' = to" o to for some to". Thus, we wish to show 
to"(to(v 1)) = to"(to(v 2 )). But, we know that to(v 1) = to(v 2 ) 
so we are done. 

Case Compat_Distinct: Here, we know that there exists no to 
such that toifii) = to(pd). Yet, we have assumed that 101 (pi) 
t02(pd) and by an argument similar to the last case, we can 
combine to 1 and w 2 to to'. This substitution to is then a unifier, 
leading to a contradiction. 

□ 
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